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Why Read This Book? 


One of Euclid’s geometry students asked a familiar question more than 2000 years 
ago. After learning the first theorem, he asked, “What shall I get by learning these 
things?” Euclid didn’t have the kind of answer the student was looking for, so he 
did what anyone would do—he got annoyed and sarcastic. The story goes that he 
called his slave and said “Give him threepence since he must make gain out of 
what he learns.”! 

It’s a familiar question: “So how am I ever gonna use this stuff?” I doubt 
that anyone has ever come up with a good answer, because it’s really the wrong 
question. The first question is not what you’re going to do with this stuff, but 
what this stuff is going to do with you. 

This book is not a computer users’ manual that will make you into a computer 
industry millionaire. It’s not a collection of tax law secrets that will save you thou- 
sands of dollars in taxes. It’s not even a compilation of important mathematical 
results for you to stack on top of the other mathematics you have learned. Instead, 
it’s an entrance into a new kingdom, the world of mathematics, where you learn 
to think and write as the inhabitants do. 

Mathematics is a discipline that requires a certain type of thinking and com- 
municating that many appreciate but few develop to a great degree. Developing 
these skills involves dissecting the components of mathematical language, ana- 
lyzing their structure, and seeing how they fit together. Once you have become 
comfortable with these principles, then your own style of mathematical writing 
can begin to shine through. 

Writing mathematics requires a precision that seems a little stifling at first. It 
might feel like some pedant is forcing you to use a limited set of words and phrases 
to express the things you already see clearly with your own mind’s eye. Be patient. 
In time you will see how adapting to the culture of mathematics and adopting its 
style of communicating will shape all your thinking and writing. You will see 
your skills of critical analysis become more developed and polished. My hope is 


1 TL. Heath, A History of Greek Mathematics 1 (Oxford, 1931). 


Why Read This Book? 


that these skills will influence the way you organize and present your thoughts 
in everything from English composition papers to late-night bull sessions with 
friends. 

Here is an analogy of what the first principles of this book will do to you. 
Consider a beginning student of the piano. Music is one of the most creative 
disciplines, and our piano student has been listening to Chopin for some time. She 
knows she has a true ear and intuition for music. However, she must begin at the 
piano by playing scales over and over. These exercises develop her ability to use 
the piano effectively in order to express the creativity within her. Furthermore, 
these repetitive tasks familiarize her with the structure of music as an art form, 
and actually nurture and expand her capacity to express herself in original and 
creative ways through music. Then, once she has mastered the basic technical skills 
of hitting the keys, she understands more clearly how enjoyable music can be. She 
learns this truth: The aesthetic elements of music cannot be fully realized until 
the technical skills developed by rote exercises have been mastered and can be 
relegated to the subconscious. 

Your first steps to becoming a mathematician are a lot like those for our pianist. 
You are going to be introduced to the building blocks of mathematical structure, 
and then you will practice on the precision required to communicate mathematics 
correctly. The drills you perform in this practice will help you see mathematics as 
a creative discipline and equip you to appreciate its beauty. 

Think of this course as a bicycle trip through a new country. The purposes of the 
trip are: 


e To familiarize you with the territory 
e To equip you to explore it on your own 


e To give you some panoramic views of the countryside 


To teach you to communicate with the inhabitants 


To help you begin to carve out your own niche 


If you’re willing to do the work, I promise you will enjoy the trip. Sometimes the 
hills are steep and the pedaling is tough. Be persistent, knowing that it’s worth 
the effort. You will come back a different person, for this material will have done 
something with you. Then you’ll understand that Euclid really got it right after 
all, and you will appreciate why his witty response is still fresh and relevant after 
these 2000 years. 


Preface 


A Transition to Abstract Mathematics was written under the assumption that 
students do not yet know how to read upper level mathematics texts. Since the 
primary purpose of the book is to teach students to write with formal rigor, and 
since I naturally presume they do not yet appreciate exposition written in that 
form, two overriding features of style defined the first edition: a loose and infor- 
mal expository style of writing, and an airtight composition and organization of 
the logic, so that no student could ever say that any necessary detail had been 
overlooked or omitted. Consequently, the scope of the first edition was rather 
narrow and forward focused, where every exercise had an important role in the 
story and there were no characters too peripheral to the plot. 

I believe the second edition maintains the benefits of the first edition’s fea- 
tures but is improved in several ways. First, the exposition is still written to the 
student, but it is tighter and more efficient than before. Second, there are many 
more exercises than in the first edition. Many of these are essential in that they 
are the logical basis of later results. The Instructor's Guide and Solutions Manual 
points out which exercises simply must be either assigned or at least discussed 
because they undergird later results. Others may be assigned, discussed casually, 
or omitted altogether. 

A third and major change to the second edition is that exercises are now inte- 
grated into the flow of the material instead of being placed at the end of each 
section. I believe this arrangement has several advantages. It better facilitates 
the students’ understanding of how the mathematics is built, one step at a time, 
because it requires their continual participation in that process at every step. In the 
second edition, the text speaks clearly to the students and then presents them with 
exercises right on the heels of every new concept. It also should make daily course 
organization easier for the instructor, in that it is always clear which exercises may 
be assigned after a particular day’s class meeting. 

Other changes to the second edition include a reorganization of the material 
that comprised Chapter 2 in the first edition. Introductory proof-writing material 
on set and real number properties has now been divided into two chapters, and the 
order of the material basically reversed from the first edition. Thus the students’ 
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first theorems involve basic algebraic properties of numbers, which might be a 
simpler place for them to begin to write proofs than set properties. Chapter 1 
now includes a section that enumerates different techniques of proof writing, with 
plenty of examples but no expectation that a student yet knows how or in what 
circumstances to employ these techniques. Finally, with exercises integrated into 
the exposition, certain sections that were quite long in the first edition have now 
been divided into more sections of more manageable length. 


Preface to the First Edition 


This text is written for a “transition course” in mathematics, where students learn 
to write proofs and communicate with a level of rigor necessary for success in 
their upper level mathematics courses. To achieve the primary goals of such a 
course, this text includes a study of basic principles of logic, techniques of proof, 
and fundamental mathematical results and ideas (sets, functions, properties of 
real numbers), though it goes much further. It is based on two premises: that 
the most important skill students can learn as they approach the cusp between 
lower and upper level courses is how to compose clear and accurate mathematical 
arguments, and that they need more help in developing this skill than they would 
normally receive by diving into standard upper level courses. By emphasizing 
how one writes mathematical prose, it is also designed to prepare students for the 
task of reading upper level mathematics texts. Furthermore, it is my hope that 
transitioning students in this way gives them a view of the mathematical land- 
scape and its beauty, engaging them to take ownership of their pursuit of 
mathematics. 


Why This Text? 


I believe students learn best by doing. In many mathematics courses it is difficult 
to find enough time for students to discover through their own efforts the mathe- 
matics we would lead them to find. However, I believe there is no other effective 
way for students to learn to write proofs. This text is written to them in a format 
that allows them to do precisely this. 

Two principles of this text are fundamental to its design as a tool whereby 
students learn by doing. First, it does not do too much for them. Proofs are inclu- 
ded in this text for only two reasons. Most of them (especially at the beginning) 
are sample proofs that students can mimic as they write their own proofs to similar 
theorems. Students must read them because they will need the technique later. 
Other proofs are included because they are too much to expect of a student at this 
level. In most of these instances, however, some climactic detail is omitted and 
relegated to an exercise. 
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Second, if students are going to learn by doing, they must be presented with 
doable tasks. This text is designed to be a sequence of stepping stones placed 
just the right distance apart. Moving from one stone to the next involves writing 
a proof. Seeing how to step there comes from reading the exposition and calls on 
the experience that led the student to the current stone. At first, stones are very 
close together, and there is a lot of guidance. Progressing through the text, stones 
become increasingly farther apart, and the guidance gets less explicit. 

I have written this text with a very deliberate trajectory of style. It is con- 
versational throughout, though sophistication and succinctness of the exposition 
increase from chapter to chapter. 


Organization 


This text is organized in the following way. Chapter 0 spells out all assump- 
tions to be used in writing proofs. These are not necessarily standard axioms 
of mathematics, and they are not presented in the context or language of more 
abstract mathematical structures. They are designed merely to be a starting point 
for logical development, so that students appreciate quickly that everything we 
call on is either stated up front as an assumption or proved from these assump- 
tions. Students can probably read Chapter 0 on their own as the course begins, 
knowing that it is there primarily as a reference. 

Part I begins with logic but does not focus on it. In Chapter 1, truth tables and 
manipulation of logical symbols are included to give students an understanding 
of mathematical grammar, of the underlying skeleton of mathematical prose, and 
of equivalent ways of communicating the same mathematical idea. Chapters 2-4 
put these to use right away in proof writing, and allow the students to cut their 
teeth on the most basic mathematical ideas. These chapters will constitute most, 
or perhaps all, of the content of the course. 

Parts IJ and III are two completely independent paths, the former into anal- 
ysis, the latter into algebra. Like Antoni Gaudi’s Sagrada Familia, the unfinished 
cathedral in Barcelona, Spain, where narrow spires rise from a foundation to give 
spectacular views, Parts II and II are purposefully designed to rest on the founda- 
tion of Part I and climb quickly into analysis or algebra. Many topics and specific 
results are omitted along the way, but Parts II and III rest securely on the founda- 
tion of Part I and allow students to continue to develop their skills at proof writing 
by climbing to a height where, I hope, they have a nice view of mathematics. 


Flexibility 


This text can be used in a variety of ways. It is suitable for use in different class 
settings, and there is much flexibility in the material one may choose to cover. 
First, because this text speaks directly to the student, it can naturally be used 
in a setting where students are given responsibility for the momentum of the class. 
It is written so that students can read the material on their own first, then bring to 
class the fruits of their work on the exercises, and present these to the instructor and 
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each other for discussion and critique. If class time and size limit the practicality 
of such a student-driven approach, then certainly other approaches are possible. 
To illustrate, we may consider three components of a course’s activity and arrange 
them in several ways. The components are (1) the students’ reading of the material, 
(2) the instructor’s elaboration on the material, and (3) the students’ work on the 
exercises, either to be presented in class or turned in. When I teach from this text, 
(1) is first, (3) follows on its heels, and (2) and (3) work in conjunction until a 
section is finished. Others might want to arrange these components in another 
order, for example, beginning with (2), then following with (1) and (3). 

Which material an instructor would choose to cover will depend on the pur- 
pose of the course, personal taste, and how much time there is. Here are two 
broad options. 


1. To proceed quickly into either analysis or algebra, first cover the material 
from Part I that lays the foundation. Almost all sections and exercises of Part I 
are necessary for Parts II and III. However, the Instructor’s Guide and Solutions 
Manual notes precisely which sections, theorems, and exercises are necessary 
for each path, and which may be safely omitted without leaving any holes in 
the logical progression. Of course, even if a particular result is necessary later, 
one might decide that to omit its proof details does not deprive the students of a 
valuable learning experience. The instructor might choose simply to elaborate 
on how one would go about proving a certain theorem, then allow the students 
to use it as if they had proved it themselves. 


2. Cover Part I in its entirety, saving specific analysis and algebra topics for later 
courses. This option might be most realistic for courses of two or three units 
where all the Part I topics are required. Even with this approach, there would 
likely be time to cover the beginnings of Parts II and/or III. This might be the 
preferred choice for those who do not want to study analysis or algebra with 
the degree of depth and breadth characteristic of this text. 
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0.1 


Notation and Assumptions 


Suppose you have just opened a new jigsaw puzzle. What are the first things you 
do? First, you pour all the pieces out of the box. Then you sort through and turn 
them all face up, taking a quick look at each one to determine whether it is an 
inside or outside piece, and you arrange them somehow so that you will have an 
idea of where certain types of pieces can be found later. You don’t study each 
piece in depth, nor do you start trying to fit any of them together. You merely lay 
all the pieces out on the table and briefly familiarize yourself with them. This is 
the point of the game where you set the stage, knowing that everything you will 
need later has been put in a place where you can find it when you need it. 

In this introductory chapter, we lay out all the pieces we will use for our work 
in this course. It is essential that you read it now, in part because you need some 
preliminary exposure to the ideas, but mostly because you need to have spelled 
out precisely what you can use without proof in Part I, where this chapter will serve 
you as a reference. Give this chapter a casual but complete reading for now. You 
have probably encountered most of the ideas before. But don’t try to remember 
it all, and certainly don’t expect to understand everything either. That is not the 
point. Right now, we are just organizing the pieces. The two issues we address in 
this chapter are: (1) Set terminology and notation, and (2) Assumptions about the 
real numbers. 


Set Terminology and Notation 


Sets are perhaps the most fundamental mathematical entity. Intuitively, we think 
of a set as a collection of things, where the collection itself is regarded as a single 
entity. Sets may contain numbers, points in the xy-plane, functions, ice cream 
cones, steak knives, worms, even other sets. We will denote many of our sets with 
uppercase letters (A, B, C), or sometimes with scripted letters (F, S, 7). First, 
we need a way of stating whether a certain thing is or is not in a set. 


Definition 0.1.1 If Aisaset and x is an entity in A, we write x € A, and say that 
x is an element of A. To write x ¢ A means that x is not an element of A. 
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How can you communicate to someone what the elements of a set are? There 
are several ways. 


1. List them. If there are only a few elements in the set, you can easily list them 
all. Otherwise, you might start listing the elements and hope that the reader 
can take the hint and figure out the pattern. For example, 


(a) 
(b) 
(c) 


{1, 8, x, Monday} 
{0,1,2,..., 40} 
{..., -6, —4, —2, 0, 2,4,6,...} 


2. Provide a description of the criteria used to define whether an entity is to be 
included. It works like this: 


(a) 


(b) 


{x : xis areal number and x > —1} 

This notation should be read “the set of all x such that x is a real number 
and x is greater than —1.” The indeterminate x is just a symbol chosen to 
represent an arbitrary element of the set, so that any characteristics it must 
have can be stated in terms of that symbol. 


{p/q: pandq are integers and q # 0} 
This is the set of all fractions, integer over integer, where it is expressly 
stated that the denominator cannot be zero. 


{x 2 P(x)} 

This is a generic form for this way of describing a set. The expression 
P(x) represents some specified property that x must have in order to be in 
the set. 


Some of the sets we will use most are the following: 


Empty set: f={} (the set with no elements) 
Natural numbers: N = {1,2,3,...} 
Whole numbers: W = {0,1,2,3,...} 
Integers: Z = {...,—3, —2, -1,0,1,2,3,...} 
Rational numbers: Q={p/q:p.¢d€Z,q 49} 


Real numbers: R (Explained in the next section) 


Nonzero real numbers: R* 


Given two sets A and B, it just might happen that all elements of A are also 
elements of B. We write this as A C B and say that A is a subset of B. Equivalently, 
we may write B > A, and say that B is a superset of A. If A is a subset of B, but 
there are elements of B that are not in A, we say that A is a proper subset of B, 
and write this A C B. 
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0.2 Assumptions about the Real Numbers 3 


Example 0.1.2 Using the sets of numbers defined previously, it follows that 
Nc WandW CZ. Itisalsotruethaté@ CN. 


Anytime we talk about a particular set, there is always a context within which 
the set is assumed to exist. For example, let A be the set of students enrolled in 
your math class, and let B be the set of first-year students at your college who 
are taking beginning French. One possible context for these sets is the set of all 
students at your college. It is as if we have a largest set from which all elements of 
all sets in the current discussion are taken. This largest set is called the universal 
set and is denoted U. We need a universal set in order to define the complement 
of a set. 


Definition 0.1.3 Given aset A, the set A© is called the complement of A and is 
defined as the set of all elements of U that are not in A. That is, 


AC = {x:xe€Uandx¢ A} 


Example 0.1.4 If the real numbers are taken as the universal set, then 
Qo = {x: xe Randx ¢ Q} 


This is the set of irrational numbers. 


Assumptions about the Real Numbers 


One big question we will face when we begin to write proofs is what we are allowed 
to assume and what we must justify with proof. The purpose of this section is to 
lay out all the assumptions we will make concerning the real numbers. In later 
chapters, we will restate these assumptions when we first need to apply them. We 
outline them here for the sake of reference. 


Basic Algebraic Properties 


The real numbers, as well as their familiar subsets N, W, Z, and Q, are assumed 
to be endowed with the relation of equality and the operations of addition and 
multiplication and to have the following properties. First, equality is assumed to 
behave in the following way. 


(A1) Properties of Equality: 


(a) Forallac R,a=a. (Reflexive) 
(b) For alla,be R,ifa=b, then b= a. (Symmetric) 


(c) Foralla,b,c € R,ifa=bandb=c,thena=c. (Transitive) 
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The first property of addition we will assume concerns its predictable behavior, 
even when the numbers involved can be addressed by more than one name. For 
example, 3/8 and 6/16 are different names for the same number. We need to know 
that adding something to 3/8 will always produce the same result as adding it to 
6/16. The following property is our way of stating this assumption. 


(A2) Addition is well defined: For all a, b,c,d €R, if a=b and c =d, then 
at+c=b+d. 


A special case of property A2 yields a familiar principle that goes back to your 
first days of high school algebra. That is the fact that if a = b, then since c = c, we 
have thata+c=b+c. 


(A3) Closure property of addition: For all a,b €¢ R,a+b€R. That is, the sum 
of two real numbers is still a real number. This closure property also holds 
for N, W, Z, and Q. 


(A4) Associative property of addition: For all a, b,c €R,(a+b)+c=at 
(b+). 


Addition is a binary operation, meaning it combines exactly two numbers 
to produce a single number result. If we have three numbers a, b, and c to 
add up, we must split the task into two steps of adding two numbers. Prop- 
erty A4 says it does not matter which two, a and b, or b and c, we add first. It 
motivates the more lax notationa+b+c. 


(A5) Commutative property of addition: For alla,b € R,a+b=b+a. 


(A6) Existence of an additive identity: There exists an element 0 € R with the 
property thata+0=aforallae R. 


(A7) Existence of additive inverses: For all a € R, there exists some b € R such 
that a+ b= 0. Such an element b is called an additive inverse of a and is 
typically denoted —a to show its relationship to a. We do not assume that 
only one such b exists. 


Properties similar to A2—A7 hold for multiplication. 
(A8) Multiplication is well defined: For alla, b, c,d € R,ifa = bandc = d, then 


ac = bd. 


(A9) Closure property of multiplication: For all a,b € R, ab € R. The closure 
property of multiplication also holds for N, W, Z, and Q. 


(A10) Associative property of multiplication: For all a, b,c € R, (ab)c = a(bc). 


(A11) Commutative property of multiplication: For all a,b € R, ab = ba. 


(A12) Existence of a multiplicative identity: There exists an element 1 € R with 
the property thata-1=aforallaeR. 


0.2.2 
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(A13) Existence of multiplicative inverses: For all nonzero aéR, there 
exists some be R such that ab=1. Such an element b is called 
a multiplicative inverse of a and is typically denoted a~! to show 
its relationship to a. As with additive inverses, we do not assume 
that only one such b exists. Furthermore, the assumption that a 
multiplicative inverse exists for all nonzero real numbers does not 
assume that zero has no multiplicative inverse. It says nothing about 
zero at all. 


The next property describes how addition and multiplication interact. 


(A14) Distributive property of multiplication over addition: For every a, b,c € 
R, a(b+ cc) = (ab) + (ac) = ab + ac, where the multiplication is assumed 
to be done before addition in the absence of parentheses. 


Property A14 is important because it is the only link between the operations 
of addition and multiplication. Several important properties of the real numbers 
owe their existence to this relationship. For example, as we will see later, the fact 
that a - 0 = 0 for every real number ais a direct result of the distributive property, 
and not something we simply assume. 

From addition and multiplication we create the operations of subtraction and 
division, respectively. Knowing that additive and multiplicative inverses exist 
(except for 0~!), we write 


a—b=a+(-b) 
a/b =a-b"! 


One very important assumption we need concerns properties A6 and A12. For 
reasons you will see later, we need to assume that the additive identity is different 
from the multiplicative identity. That is, we need the assumption 


(A15) 140. 


We will use these very basic properties to derive some other familiar properties 
of real numbers in Chapter 2. 


Ordering Properties 


One standard way of comparing two real numbers is with the greater than sym- 
bol >. Intuitively, we think of the statement a > b as meaning that a is to the right 
of b on the number line. Though this is helpful, the comparison a > b is actually 
a bit sticky. The nuts and bolts of > are contained in the following. In A16, we 
make an assumption about how real numbers compare to zero by >, thus giving 
meaning to the terms positive and negative. Then in A17 and A18, we make some 
assumptions about how the positive real numbers behave. 
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(A16) Trichotomy law: For any a € R, exactly one of the following is true: 


(a) a> 0,in which case we say a is positive 
(b) a=0 
(c) 0->a,in which case we say a is negative 
(A17) Foralla,b € R,ifa > Oandb > 0,thena + b > 0. That is, the set of positive 
real numbers is closed under addition. 


(A18) For alla, b € R,ifa > 0 and b > 0, then ab > 0. That is, the set of positive 
real numbers is closed under multiplication. 


Now we can use A16—A18 to give meaning to other statements comparing any 
pair of real numbers. 


Definition 0.2.1 Given real numbers a and b, we say that a> b ifa—b> 0. 
The statement a < b means b > a. The statement a > b means that either a > b or 
a = b. Similarly, a < b means either a < bora=b. 


The rest of the properties of real numbers are probably not as familiar as the 
ones above, but their roles in the theory of real numbers will be clarified in good 
time. As with the previous properties, we do not try to justify them. We merely 
accept them and use them as a basis for proofs. A very important property of the 
whole numbers is the following. 


(A19) Well-ordering principle: Any non-empty subset of whole numbers has a 
smallest element. That is, if A is a non-empty set of whole numbers, then 
there is some number a € A with the property that a < x for all x € A. In 
particular, we assume that 1 is the smallest natural number. 


The next property of the real numbers is a bit complicated but is indispensible 
in the theory of real numbers. Read it casually for the first time, but know that 
it will be very important in Part II of this text. Suppose A is a non-empty subset 
of the real numbers with the property that it is bounded from above. That is, 
suppose there is some real number M with the property that a < M for allae A. 
For example, let A = {x: x? < 10}. Clearly, M = 4 is a number such that every 
a eé A sSatisfies a < M. So 4 is an upper bound for the set A. There are other upper 
bounds for A, such as 10, 3.3, and 3.17. The point is that, among all upper bounds 
that exist for a set, there is an upper bound that is smallest, and it exists in the real 
numbers. This is stated in the following. 


(A20) Least upper bound property: If A is a non-empty subset of the real num- 
bers that is bounded from above, then there exists a least upper bound 
in the real numbers. That is, if there exists some M e€ R with the property 
that a < M for all ae A, then there will also exist some L € R with the 
following properties: 
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(L1) For every a € A, we have that a < L, and 
(L2) If N is any upper bound for A, it must be that N > L. 


0.2.3 Other Assumptions 


The real numbers are indeed a complicated set. The final real number properties 
we mention are not standard assumptions, and they deserve your attention at some 
point in your mathematical career. In this text, we assume them. 


(A21) The real numbers can be equated with the set of all base 10 decimal repre- 
sentations. That is, every real number can be written in a form like 
338.1898 ..., where the decimal might or might not terminate, and might 
or might not fall into a pattern of repetition. Furthermore, every decimal 
form you can construct represents a real number. Strangely, though, there 
might be more than one decimal representation for a certain real number. 


You might remember that 0.9999... = 1. The repeating 9 is the only case 
where more than one decimal representation is possible. We will assume 
this. 


Our final assumption concerns the existence of roots of real numbers. 


(A22) For every positive real number x and any natural number 7, there exists a 
real number solution y to the equation y” = x. Such a solution y is called 
an nth root of x. The common notation 2/x will be addressed in Section 3.9. 


Notice we make no assumptions about how many such roots of x there are, or 
what their signs are. Nor do we assume anything about roots of zero or of negative 
real numbers. We will derive these from assumption A22. 

One final comment about assumptions in mathematics is in order. In a rigorous 
development of any mathematical theory, some things must be assumed without 
proof; that is, they must be axiomatic, serving as a starting place for the mathemati- 
cian’s thinking. In a study of the real numbers, some of the assumptions A1l-A22 
are standard. Others would be considered standard assumptions only for some 
subsets of the real numbers, perhaps for the whole numbers. The mathematician 
would then very painstakingly apply assumptions made to the whole numbers in 
order to expand the same properties to all of the real numbers. One assumption 
in particular, A21, is a most presumptuous one. But let us make no apologies for 
this. After all, many of the foundational issues in mathematics were addressed 
very late historically, and this is not a course in the foundations of mathematics. It 
is a course to teach us how mathematics is done and to give us some enjoyment of 
that process. We choose assumptions here that likely coincide with your current 
idea of a reasonable place to start. In some cases, we will dig more deeply as we 
go, though some of the foundational work will come in your later courses. 
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1.1 


Language and Mathematics 


One main purpose of this text is to develop your use of language in the context 
of mathematics. In this chapter, we will lay out some of the principles that govern 
the mathematician’s very precise use of language. 


Introduction to Logic 
Statements 


The first issue we address is what kinds of sentences mathematicians use as building 
blocks of their work. Remember from elementary school grammar, sentences are 
generally divided into four classes: 


Declarative sentences: We also call these statements. Here are some examples: 


. Labor Day is the first Monday in September. 
. Earthquakes don’t happen in California. 


1 

2 

3. 3 is greater than 7. 

4. 28,657 is a prime number. 
5 


. If F, is the nth Fibonacci number and if F,, is prime, then n is prime. 


One characteristic of statements that jumps out at you is that they generally 
evoke a reaction like “Yeah, that’s true,” or “No way,” or even “That could be true, 
but I don’t know for sure.” Statements 1,4, and 5 are true, and statements 2 and 3 
are false. 


Imperative sentences: We would call these commands. 


1. Don’t wash your red bathrobe with your white underwear. 


2. Knock three times on the ceiling if you want me. 


11 
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Interrogative sentences: That is, questions. 


1. How much is that doggy in the window? 
2. Why do fools fall in love? 


Exclamations: 


1. What a day! 
2. Lions and tigers and bears, Oh my! 


3. So, like, whatever. 


The building blocks of the mathematician’s work are statements, but we have 
to be careful about exactly which declarative sentences we allow. We will define a 
statement intuitively as a sentence that can be assigned either to the class of things 
we would call TRUE or to the class of things we would call FALSE. Two pitfalls 
present themselves immediately. 

First, there are paradoxes. For example, “This sentence is false” cannot be 
either true or false. If you think the sentence is true, then it is false. But, if it is 
false, then it is true. We do not want to consider paradoxes as statements. 


EXERCISE 1.1.1 Suppose the barber of Seville is a man who lives in the town 
of Seville. Determine whether each of the following statements is a paradox. 


(a) The barber of Seville shaves every man in the town of Seville who does not 
shave himself. 


(b) The barber of Seville does not shave any man in the town of Seville who 
shaves himself. 


(c) The barber of Seville shaves every man in Seville who does not shave himself, 
but he does not shave any man in Seville who does shave himself. 


Second, some expressions contain what logicians call an indeterminate. The 
presence of an indeterminate in an expression excludes the expression from being 
a statement. For example, to say 


x can be written as the sum of two prime numbers 
is not considered a statement. The use of the indeterminate x is like leaving a 
blank unfilled, so that the expression cannot be classified as either true or false. 


However, if we were to say 


Every integer between 3 and 20 can be written 
as the sum of two prime numbers, 


then this is a statement. 
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Now imagine the set of all conceivable statements, and call it S. Certainly, this 
set is frighteningly large and complex, but a most important characteristic of its 
elements is that each one can (at least in theory) be placed into exactly one of two 
subsets: T (statements called TRUE) and F (statements called FALSE). We want 
to look at relationships between these statements. Specifically, we want to pick 
statements from S, change or combine them to make other statements in S, and 
lay out some understandings of how the truth or falsity of the chosen statements 
determines the truth or falsity of the alterations and combinations. In the next 
part of this section, we discuss three ways of doing this: 


e The negation of a statement 

e A compound statement formed by joining two given statements with AND 

e¢ A compound statement formed by joining two given statements with OR 
Negation of a Statement 


We generally use p, q,r, and so forth, to represent statements symbolically. For 
example, define a statement p as follows: 


p: Meghan has rented a car for today. 


Now consider the negation or denial of p, which we can create by a strategic 
placement of the word NOT somewhere in the statement. We write it this way: 


ap: Meghan has not rented a car for today. 


If p is true, then —p is false, and vice versa. We illustrate this in a truth table, 
Table 1.1. 


p|=p 
T| F (1.1) 
F| T 


Definition 1.1.2 Given a statement p, we define the statement —p (not p) to 
be false when p is true, and true when p is false, as illustrated in Table 1.1. 


Combining Statements with AND 


When two statements are joined by AND to produce a compound statement, we 
need a way of deciding whether the compound statement is true or false, based on 
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the truth or falsity of its component statements. Let’s build these with an example. 
Define statements p and gq as follows: 


p: Meghan is at least 25 years old. 


q: Meghan has a valid driver’s license. 
Now let’s create the statement we call “p and g,” which we write as 


pAq: Meghan is at least 25 years old, and she has a 
valid driver’s license. 


If you know the truth or falsity of p and q individually, how would you be 
inclined to categorize p A q?! Naturally, the only way that we would consider 
pq to be true is if both p and gq are true. In any other instance, we would say 
p Aqis false. So whether p A gisin7 or F depends on whether p and q arein7 or 
F individually. We illustrate the results of all different combinations in Table 1.2. 
Notice how the truth table is constructed with four rows, systematically displaying 
all possible combinations of T and F for p and q. 


p|@a\|pa4q 

T | 2 or 

T|F] F (1.2) 
FIT] F 

F/F] F 


Definition 1.1.3 Given two statements p and q, we define the statement p A q 
(p and q) to be true precisely when both p and q are true, as illustrated in Table 1.2. 


1.1.4 Combining Statements with OR 


Define statements p and g by 


p: Meghan has insurance that covers her for any car she drives. 


q: Meghan bought the optional insurance provided by the car 
rental company. 


! Pretend Meghan is standing at a car rental counter and must answer yes or no to the question “Are 
you at least 25 years old and have a valid driver’s license?” 
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The compound statement we call “p or q” is written 


p\q: Meghan has insurance that covers her for any car she drives, 
or she bought the optional insurance provided by the 
car rental company. 


How should we assign T or F to p V q based on the truth or falsity of p and q 
individually?* We define p v q to be true if at least one of the two statements is 
true. See Table 1.3. 


(1.3) 


Todds 
NTaHimnmryHp 
MHooraAoHi< 


Definition 1.1.4 Given two statements p and q, we define the statement p v qg 
(p or q) to be true precisely when at least one of p and q is true, as illustrated in 
Table 1.3. 


Our conversational language is sometimes ambiguous when we use the word 
OR, and we often use OR in ways that are different from the mathematical use. 
For example, suppose a friend asks you “Did you have a cheeseburger or pizza 
for lunch?” In conversation, you would answer, “Oh, I had pizza,” and your friend 
would likely conclude that you did not have the cheeseburger. In mathemat- 
ics, however, you should answer your friend’s question with a simple yes or no, 
depending on whether you had at least one of these two items. 

To address a statement involving p and q that is true when precisely one of 
them is true, we use the exclusive or, which is written p Vv q. See Table 1.4. 


P|4@|pva 

T|T| F 

TF] T (1.4) 
FIT] T 

F/F] F 


Now we can build all kinds of compound statements. 


2 Pretend Meghan’s friend is worried about being covered in case of an accident. He asks Meghan, “Do 
you have your own insurance, or did you buy the optional coverage provided by the rental company?” 
Under what circumstances should she say yes? 
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Example 1.1.5 Construct a truth table for the statement (p A q) Vv (=p A 79). 


See Table 1.5. 


Solution 


P| qd |7P| 74d | PAG | ~PA~9 | PADV CPA 
T/T| F | F T F T 
T|F| F/T] F F F (1.5) 
F/T] T | F F F F 
F/F|] T | T F T T 
| 


EXERCISE 1.1.6 Use the layout of Table 1.6 to construct truth table columns 


for pA (qV r) and (pA q)V (pAr). 


qVvr | pAqVr) | pA 


PAT | (PAQV (PAD) 


Amy aay HAH HYD 
Amo AaxAa mama HAs 
deunamasa Sx 


(1.6) 


EXERCISE 1.1.7 Construct truth tables for the following statements. 


(a) pV (qvr) 

(b) -(pAq)V (PV Q) 
(c) (pyYqgyr 

(d) =pvq 

(e) pA(@YV =p) 


Logical Equivalence 


There are often several equivalent ways to say the same thing. We need to address 
the situation where two different constructs involving statements should be inter- 
preted as having the same meaning, or as being logically equivalent. As a trivial 
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example, consider that p A q should certainly be viewed as having the same mean- 
ing as q A p. To build a truth table would produce identical columns for p A q and 
q / p. This is the way we define logical equivalence. 


Definition 1.1.8 Two statements are said to be logically equivalent if they have 
precisely the same truth table values. If U and V are logically equivalent, we write 
Us V. 


Notice that the two statements in Exercise 1.1.6 are logically equivalent. To say 
“p AND either g or r” has the same meaning to us as “p and q, OR p and r.” This is 
a sort of distributive property, where A distributes over V, just like multiplication 
distributes over addition in the real numbers. 


EXERCISE 1.1.9 Show that v distributes over A by showing that p v (q Ar) is 
logically equivalent to (p V gq) A (qv r). 


Parts (a) and (b) of Exercise 1.1.7 show that (p v q) v ris logically equivalent to 
pV (q\V r). We say that v has the associative property, and we may allow ourselves 
the freedom to write p V g Vr to mean either (pV gq) Vror pv (qVr). 


EXERCISE 1.1.10 Does A have the associative property? Verify your answer 
with a truth table. 


EXERCISE 1.1.11 Construct a statement using only p, g, A, V, and — that is 
logically equivalent to p V g. Demonstrate logical equivalence with a truth table. 


EXERCISE 1.1.12 Below are two logical equivalences called DeMorgan’s laws 
8g q 

(a name you will want to remember). Verify these forms of DeMorgan’s laws with 

truth tables. 

(a) ~(p Aq) @ =pv 74 

(b) ~(pv q) @ 7p A7q 


With Exercises 1.1.7(a) and (c), 1.1.10, and 1.1.12(a), we can extend one form 
of DeMorgan’s law by using the following manipulation of logical symbols: 


ApAGAT) SAAPAQDATIS APAQV-7r 


(1.7) 


ASV IQ) AI BN Ag VE 


EXERCISE 1.1.13 Mimic the manipulation in (1.7) to show that =(p v q Vr) is 
logically equivalent to =p A 7=q A -r. 


EXERCISE 1.1.14 Use DeMorgan’s laws and symbolic manipulation to show 
that —[(p V g) Ar] is logically equivalent to (—p A 7q) Vv -r. 
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Tautologies and Contradictions 


Sometimes a truth table column just happens to have TRUE values all the way 
down, as in Exercise 1.1.7(b). Another easy example is p Vv =p. A statement like 
“Either Meghan has a valid driver’s license, or she does not” would make you think, 
“Ofcourse!” or “Naturally this is a true statement regardless of the circumstances.” 
A statement whose truth table values are all TRUE is called a tautology. 

The negation of a tautology is called a contradiction, and the truth table values 
of a contradition are all FALSE. In the same way that a tautology is the kind of 
statement that makes you think “Of course!,” a contradiction makes you think “No 
way can that ever be true!” A really easy example of a contradiction is p A —p. 
Since p and —p cannot ever both be true, p A =p is always false. Tautologies and 
contradictions are very useful, as we will begin to see in Chapter 2. 


EXERCISE 1.1.15 Show that the statement (pA q) V (=p V 79g) is a tautology. 


If-Then Statements 


In this section, we want to do two things: (1) Consider the logical structure of the 
statement that p implies or necessitates q and its variations; and (2) return to the 
idea of logical equivalence and its connection to tautologies. 


If-Then Statements Defined 
On the first day of class, your professor makes the following statement: 
Either you turn in your homework on time or you get a zero on it. 


Being the quick-witted logician, you immediately make mental definitions of the 
following statements: 


p: Your homework is late. 


q: You get a zero on your homework. 


and you note that your professor’s statement becomes —p v q. You also recall from 
Exercise 1.1.7(d) that the truth table for —p v ghas only one FALSE entry, and that 
is the entry where p is true but q is false. You think, “Late homework guarantees 
a zero, but even if I turn it in on time, I still might get a zero.” Furthermore, 
you observe that it is natural to rephrase your professor’s statement as an if-then 
statement in the following way: 


If your homework is late, then you get a zero on it. 


This statement is called “If p, then q” or simply “p implies q,” and is written p > q. 
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Definition 1.2.1 The statement p — q is defined to be logically equivalent to 
=p V q,as illustrated in Table 1.8. We call p the hypothesis and q the conclusion. 


(1.8) 


mod als 
mon sis 
Hana 


Example 1.2.2 Construct truth tables for the following statements. 


1. p> 7q 
2. (pAqg—7r 


Solution Remember, the only FALSE entry in an if-then column occurs when 
the hypothesis is true and the conclusion is false. See Tables 1.9 and 1.10. 


Pg | 79g | peg 
FE) TF F 
T/F/|T ai (1.9) 
F/T] F T 
F|F| T T 
P\qaq{|r{|paAq|@MAg->r 
lls Weald OV T 
P| | ee F 
TF Ps] Se T 
T/|F/F] F T (1.10) 
Be | || oF T 
F/T|/F| F T 
F/F/T| F T 
F/F/F| F T 
| 


EXERCISE 1.2.3 Construct truth tables for the following statements. 


(a) -p> -q 
(b) q>p 
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(c) -q>—p 
(4) (PYg>r 
(ec) (P>QMAQG> P) 
(f) p> @vn 


EXERCISE 1.2.4 Verify that each of the following statements is a tautology. 


(a) (p> gApl>4@ (modus ponens) 
(b) [p> QA-q] > =p (modus tollens) 
(c) (pv q) 74] > p (disjunctive syllogism) 
(d) [p> QMAqrenl> pr) (hypothetical syllogism) 


(e) {[(prAg) > rn]Ap}> qn 


Example 1.2.5  Rephrase the following statements in the form “If p, then g.” 


1. Every senior in the class made an A on the final exam. 


2. An integer is prime only if it is greater than 1. 
Solution 


1. If you are a senior in the class, then you made an A on the final exam. 


2. If an integer is prime, then itis greaterthan1. 


EXERCISE 1.2.6 Rephrase the following statements in the form “If p, then q.” 


(a) Eithera > Oora <0. 

(b) Either I’m crazy, or there is a pink elephant floating overhead. 
(c) Either it’s hot in here, or I’m coming down with the flu. 

(d) Every time a bell rings, an angel gets his wings. 

(e) Only fools fall in love. 

(f) Only koalas eat eucalyptus leaves. 

(g) It only hurts when I laugh. 


3 Is this saying that all laughter is painful? 
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2 _ x = 0 are nonnegative. 


(h) The only solutions to the equation x 
(i) Drivers use the carpool lane only if there are at least two people in the car. 
(j) A function is differentiable only if it is continuous. 

(k) U only if V. 
(1) Brett won’t babysit for us unless we pay him minimum wage. 

(m) Unless you follow the instructions, the cake won’t turn out right. 


(n) No shoes, no shirt, no service. 


EXERCISE 1.2.7. The following statements are either in the form p > q or can 
be stated in that form. Rephrase them in the equivalent form —p v q. 


(a) If you don’t cross at the crosswalk, you'll get run over. 

(b) If my boss is telling me the truth, then I will get a bonus this year. 
(c) If you don’t pay your bill, they will shut off your electricity. 

(d) Unless you follow the instructions, the cake won’t turn out right. 
(e) Brett won’t babysit for us unless we pay him minimum wage. 


(f) A function is differentiable only if it is continuous. 


1.2.2 Variations on p > q 


Given two statements p and g, we might want to analyze other possible if-then 
statements besides p —> q. Let’s define the following statements: 


p : Rudy has calculus homework. 
q : Rudy goes to the library. 


Letting p — q be our primary statement, we can put p and q together in other 
important ways. 


p> If Rudy has calculus homework, 
then he goes to the library. (Primary statement) 


q>p If Rudy goes to the library, 


then he has calculus homework. (Converse) 
=p—>-4q If Rudy does not have calculus homework, 
then he does not go to the library. (Inverse) 


=q—> 7p If Rudy does not to go the library, 
then he does not have calculus homework. (Contrapositive) 
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Example 1.2.8 In Exercise 1.2.3, you constructed truth tables for g > p, =p > 
aq, and —q — —p. The truth table values are displayed in Table 1.11. Notice that 
p — qis logically equivalent to its contrapositive, and its converse and inverse are 
logically equivalent. 


pP|qa|-p|-a|p>4a\|qa>p|-p> 74a | -a> =p 
T|T| F | F T T T T 
T|F| F|T F T T F (1.11) 
F/T| T | F T F F T 
BE |. a ee T T T T 

= 


EXERCISE 1.2.9 Suppose x represents some specific, but unknown real number. 
State the converse, inverse, and contrapositive of the following statement: 


Ifx >1,thenx?—x>0. 


An important construct involving if-then is the statement that is true precisely 
when p and q are either both true or both false. For example, if an integer is 
divisible by 6, then it is divisible by both 2 and 3. Conversely, if an integer is 
divisible by both 2 and 3, then it is divisible by 6. We say that an integer is divisible 
by 6 if and only if it is divisible by both 2 and 3. 


Definition 1.2.10 Given statements p and gq, the statement p < q (read “p if 
and only if g,” and often written “p if q”) is defined to be true precisely when p 
and q are either both true or both false. 


EXERCISE 1.2.11 Construct a truth table for (p > q) A (q > p) to show that 
it is logically equivalent to p = q. 


.2. et p, q, and r represent, respectively, the statements that 
EXERCISE 1.2.12 L dr rep pectively, th h 
Penelope, Quentin, and Rhonda studied for the math test. Using p, g, 7,7, A,V,V, 
—,and <, translate the following sentences into a symbolic logical construction. 
(a) Rhonda studied for the test, but Penelope did not. 

(b) If Rhonda studied for the test, then so did Penelope. 

(c) If Rhonda did not study for the test, then Penelope and Quentin didn’t either. 
(d) Quentin and Rhonda either both studied or both did not study. 

(ec) Neither Penelope nor Rhonda studied for the test. 

(f) If Penelope did not study for the test, then neither did Rhonda. 
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(g) If either Penelope or Rhonda studied for the test, then so did Quentin. 


(h) Precisely one of Penelope and Quentin studied for the test. 


EXERCISE 1.2.13 Which two of the statements in Exercise 1.2.12 are logically 
equivalent to each other? 


Logical Equivalence and Tautologies 


In Section 1.1, we defined two statements to be logically equivalent if they have 
exactly the same truth table values. With the definition of p < q, we can now offer 
an alternate definition of logical equivalence. 


Definition 1.2.14 Two statements U and V are logically equivalent if the 
statement U + V is a tautology. 


As we noted in Section 1.1.5, the significance of statements being logically 
equivalent is that they are different ways of saying precisely the same thing. If U 
is logically equivalent to V, then knowing U is true guarantees that V is true, and 
vice versa. 

If U + V is a tautology, then U — V and V > U must both be tautologies, 
too. Loosely speaking, the truth of U is sufficiently strong to imply the truth of V 
and vice versa. 


Example 1.2.15 Let U be the statement p — q and let V be the statement 
aq — —p. Show that U and V are logically equivalent using Definition 1.2.14. 


Solution We already know that the truth table columns for p > g and -7=q > —p 
are identical, but we are asked to use Definition 1.2.14. So we construct (p > q) © 
(-—q —> =p) and show that it is a tautology. The details are displayed in Table 1.12. 


P\lq\|rpilrq|Ul|VvVi|UuoV\i|Vou|UnVAWV>U) 
T|T| F F/T |T T T T 
T|F| F T | F/F T T T 
F/T | T F/T |T T T T 
F)F{ T T | T/T T T T 


(1.12) 


Since the last column of Table 1.12 is a tautology, U and V are logically equiv- 
alent. Notice that the columns U > V and V > U are tautologies to make the 
last column a tautology. 


Now let’s consider the situation where U < V is not a tautology, but one of 
U—> VorV > Uis. IfU > Visa tautology while V > U is not, then we say that 
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U is a stronger statement than V. This means that the truth of U necessitates the 
truth of V, but the truth of V is not necessarily accompanied by the truth of U. 


Example 1.2.16 Which statement is stronger, p or p Aq? Verify with a truth 
table. 


Solution See Table 1.13. Since (p A gq) > p isa tautology while p > (p Aq) is 
not, p A q is stronger than p. Knowing p A q is true guarantees that p is true, but 
knowing p is true does not guarantee that p A q is true. 


P|a|PAG| PAD>P | P>(PAQ) 
TIT] T Tr T 
TIF] F 7 F (1.13) 
F/T| F T T 
F/F| F T T 
a 


Example 1.2.17 Which statement do you think is stronger, (p A q) > ror p > 
r? Determine for sure with a truth table. 


Solution Writing (p A gq) > ras U and p > ras V, we need truth table values 
for U > Vand V => U. (See Table 1.14.) Since V > U isa tautology and U > V 
is not, V is stronger than U. 


< 
< 


AHAn eSB vA HA 
onl 


U:(pAgro>r|V:pror|U 
T T 


Q 


Toma aa als 
Toons dis 
THmanmeansls 
Tata als 
HHxsaHeaD 

HHxataD 

ae ne 


(1.14) 
P| 


Notice this important fact. Statements U and V from Example 1.2.17 have the 
same conclusion, but the hypothesis condition for U is stronger than the hypothesis 
condition for V. This makes U a weaker statement than V. 

What is the significance of one statement being stronger than another? Here 
is an example to illustrate Example 1.2.17. Define the following statements. 
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p : Meghan is at least 25 years old. 
q : Meghan has a valid driver’s license. 


r : Meghan is allowed to rent a car. 


What does it mean to say that p — r is stronger than (p A q) > r? The statement 
(p Aq) > rsays that age and a license will guarantee your eligibility to rent a car, 
while p — r says that age alone is sufficient to be eligible. So if everyone at least 25 
years old can rent a car, then certainly everyone at least 25 with a license can, too. 
Thus if p — r is true, so is (p A gq) > r. On the other hand, just because licensed 
people at least 25 years old can rent a car, it does not follow that all people over 
25 can do the same. That is, the truth of (p A gq) > r does not imply that p > r 
is true. 


EXERCISE 1.2.18 Foreachof the following pairs of statements, use a truth table 
to determine whether they are logically equivalent, or that one is stronger than 
the other. 


(a) p PYG 

(b) por (pyvqy>r 

(c) por (p> QgAq>r) 
(d) (pvgor (p>nVvV(q>7r) 
(e) (pygror (p> rnA(q>nr) 
(f) p> @Ar) (p> QA(p>?r) 
(g) pvq (PV QA7~q 

(h) p> vr) (pA7q)> Fr 

(i) (PAg>r (pA nr) > 7q 
(Gj) PeQArss) (pvr) (qVs) 
(k) pog Ap eG 


EXERCISE 1.2.19 This exercise investigates the general principle that an impli- 
cation statement is strengthened when its hypothesis condition is weakened. 


(a) Suppose U and V are two statements, and U is stronger than V. What must 
be true about the truth table entries for U as compared to those for V?4 


(b) If U is stronger than V, why does that make U — r weaker than U > r?° 


4 The set of F entries for one must be a proper subset of the F entries for the other. 
5 Use your answer to part (a), and the definition of p > q. 
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(c) Which of the following pairs of statements is stronger? Explain without using 


a truth table. 


(i) p>r and (pvqgy>r 
Gi) [pPAqq>s] >t and {paA[qAn- s]}} >t 


Example 1.2.20 Without justifying by proof, state whether statements in each 
of the following pairs are logically equivalent, whether one is stronger than the 
other, or neither. 


1. 
p: xis an integer that is divisible by 6. 
q: xis an integer that is divisible by 3. 
2. 
p: P—4x7+4x =0 
q: x € {0, 2, 4} 
3. 
p:—2<x<2 
ee | 
4, 
p:xisan integer multiple of 10. 
q: xis an integer multiple of 15. 
Solution 
1. If an integer is divisible by 6, then it is divisible by both 2 and 3. However, 


divisibility by 3 does not necessitate divisibility by 6. For example, 9 is divisible 
by 3 but not divisible by 6. So p is stronger than gq. 


. Factoring and solving the equation in p yield x = Oorx = 2. Soifx = Oorx = 2, 


then x € {0, 2, 4}. But x = 41s not a solution to the equation x — 4x2 + 4x = 0. 
Thus if x € {0, 2, 4}, it does not follow that x? — 4x? + 4x = 0. Therefore pis 
stronger than q. 


. A particular value of x will either make p and q both true or both false. So they 


are logically equivalent. 


. If x is an integer multiple of 10, it might or might not be an integer multiple 


of 15, as is illustrated by x = 20 and x = 30. Furthermore, just because x is an 
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integer multiple of 15, it might or might not be an integer multiple of 10, as is 
illustrated by x = 30 and x = 45. Thus neither of these statements is stronger 
than the other. Hf 


EXERCISE 1.2.21 Suppose N is some unknown real number and f is a given 
function. Without justifying by proof, state whether statements in each of the 
following pairs are logically equivalent, whether one is stronger than the other, or 
neither. 


(a) 
N is a natural number. 
N is an integer. 
(b) 
N>5 
N> 10 
(c) 
If N>5,then f(N) > 0. 
If N > 10, then f(N) > 0. 
(d) 


If N > 5 and Nis an integer, then f(N) > 0. 
If N > 10, then f(N) > 0. 


Universal and Existential Quantifiers 


Most of the examples of if-then statements in Section 1.2 were not about math- 
ematical objects, but about people, situations, and behaviors. The reason is that 
mathematical if-then statements generally involve indeterminates (as the state- 
ments defined on p. 21), and these are inextricably tied to the presence of what 
we call universal and existential quantifiers. 

We might make a statement like: 


All squares are rectangles, 
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which we might phrase as 
If x is a square, then x is a rectangle, 
but which a more formal logician would express as 


For all x, if x is a square, then x is a rectangle. 


This last statement might sound odd, but you will see in Section 1.4 why the 
presence of “for all x” is important. In this section, we will express statements in 
a variety of ways, each of which is natural and appropriate in a certain context. 


1.3.1. The Universal Quantifier 


The inequality n? < 2” is true for all natural numbers n > 5. (You will prove this 
in Section 3.4.) We could express this in any of the following ways. 


For all natural numbers n, if n > 5, then n? < 2”. 
For all n,ifn € Nandn > 5, then n? < 2”. 
(Wn)[n eNAn>5) > (nr? <2"). 


These statements are arranged in increasing order of formality. The last one 
uses the symbol V (for all), which is called the universal quantifier. 


Example 1.3.1 The following are all acceptable ways of making the same 
universal statement. 


Every element of the set B is negative. 
For all x € B, x < 0. 

(Vx € B)(x < 0). 

For all x, if x € B, then x < 0. 


(Vx\(xEeBr>x<0). 


All five of the statements in Example 1.3.1 have their place in mathematical 
discourse, from the very informal first one to the very formal last one. Rather 
than think of the last statement as a formalization of the first, however, it is most 
helpful to think of the first as a more conversational equivalent of the last. 
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Example 1.3.2 Suppose f is a given function. Then the following statements 


are equivalent. 


The graph of f does not intersect the x-axis. 


For all real numbers x, f (x) 4 0. 


(Vx € R)(f(@) #9). 
For all x, if x € R, then f (x) 4 0. 
(Vx)(xER—> f(x) £0). 


The most general form for a statement involving the universal quantifer would 
look something like this. If P(x) is some property stated in terms of x, such as 
“xe R—- f(x) 40,” then a general statement involving the universal quantifier 
would be written 


(Wx) (P(x)) (1.15) 
and would be read, “For all x, P(x).” 1 
EXERCISE 1.3.3 Let D be the set of all differentiable functions and C the set 


of all continuous functions. Construct several statements (as in Examples 1.3.1 
and 1.3.2) that use the universal quantifier and are equivalent to the statement 


If a function is differentiable, then it is continuous. 


EXERCISE 1.3.4 Construct several statements that use the universal quantifier 
and are equivalent to the statement 


The only real number solutions to the equation x” — x = 0 are nonnegative. 


The Existential Quantifier 


The inequality n* < 2” is true forn > 5, but notif 1 <n < 4. In other words, there 
exist natural numbers n (at least one such number) such that n? > 2”. 

There exists a natural number n such that n? > 2”. 

There exists n such that n € N and n? > 2”. 

(An)(n EN An? > 2"), 


The expression “there exists,” written mathematically as J, is called the existential 
quantifier. 
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Example 1.3.5 The following are all acceptable ways of making the same 


existential statement. 


Some element of the set B is positive. 
There exists x € B such that x > 0. 

(ax € B)(x > 0). 

There exists x such that x € Band x > 0. 


(Ax)(x€ BAx > 0). 


The most general form of an existential statement involving a property P(x) 
would be written 


(ax) (P(x) (1.16) 
and would be read, “There exists x such that P(x).” I 
EXERCISE 1.3.6 Let D be the set of all differentiable functions and C the set of 


all continuous functions. Construct several statements (as in Example 1.3.5) that 
use the existential quantifier and are equivalent to the statement 


Some continuous functions are not differentiable. 


EXERCISE 1.3.7 Construct several statements that use the existential quantifier 
and are equivalent to the statement 


The equation x” + x = 1 has a negative real number solution. 


Now the plot thickens. 


Example 1.3.8 Suppose F is a set of functions. Here are three equivalent 
statements. 

The graph of every function in F intersects the x-axis at least once. 

For all f € F, there exists x € R such that f (x) = 0. 

Whe F)@xeR(f@=09). g 


EXERCISE 1.3.9 Let F be a set of functions. Restate the following using 
universal and existential quantifiers. 


1.3. Universal and Existential Quantifiers 31 


(a) There is a function in F whose graph intersects the x-axis. 
(b) No function in F has a graph that intersects the x-axis. 


(c) There is a function in F whose graph does not intersect the x-axis. 


Sometimes a universal statement might be expressed informally by placing 
the V phrase after the property that is universally true. This is merely a way to 
make a sentence sound more natural. We would not write the statement that way 
symbolically. The next example illustrates this with a range of expressions from 
informal to formal. 


Example 1.3.10 The following statements are equivalent. 


There is an element of set A that is less than every element of set B. 
There exists x € A such that x < y forall ye B. 
There exists x € A such that, for all y € B, x < y. 


(Ax € A)(Vy € B)(x < y). & 


EXERCISE 1.3.11 For each of the following statements, use your creativ- 
ity and construct several statements that are logically equivalent to the given 
statement. 

(a) No element of the set A exceeds m. 

(b) Some element of the set A exceeds m. 

(c) A contains an element that is greater than every element of B. 

(d) Every element of A is greater than every element of B. 

(e) There is some element of A that is greater than some element of B. 

(f) Every element of A is greater than some element of B. 

(g) There is no real number that is in both sets A and B. 


(h) Every element of the set A can be written as the sum of the squares of two 
integers. 


(i) The equation x* + 1 = 0 has no real number solution. 
(j) Every real number has an additive inverse. 
(k) Ifn is an odd integer, then n is an odd integer. 


(1) There is a real number that does not have a multiplicative inverse. 


32 


1.3.3 


Chapter 1 Language and Mathematics 
Unique Existence 


Sometimes it is important to know not only that something exists, but also that 
exactly one such thing exists. If there exists exactly one thing with a certain 
property, we say that it exists uniquely. The mathematical statement 


(Alx)(P (x) (1.17) 


is read, “There exists a unique x such that P (x).” 

To define unique existence, we must amend the statement (4x) (P (x)) to include 
the additional stipulation that there is no more than one such x. The standard way 
of defining unique existence is the following. 


Definition 1.3.12 We say that there exists a unique x with property P provided 
two things are true. First, there exists x with property P, and second, for all x; and 
x2, if x1 and x2 both have property P, then x; and x2 are the same. In other words, 
the logical statement (A!x)(P (x)) is defined by 


[(Ax)(P (x))] A [0Vx1) (x2) (P (x1) A P(x2)) > 1 = x2)] (1.18) 


Unique existence is really a compound AND statement. The first component 
says that at least one thing with the property exists, and the second component 
says that at most one thing with the property exists. 


Example 1.3.13 Reword the following statements of unique existence in a form 
like that in Definition 1.3.12. 
1. There exists a unique real number x such that x? = 8. 


2. For alla € R, there exists a unique b € R such that a+ b = 0. 
Solution 


1. There exists a real number x such that x? = 8, and for all x, x2 € R, if e =8 
and oe = 8, then xj = x2. 


2. For all a € R, there exists b € R such that a+ b= 0, and for all b,, b2 € R, if 
a+b, =Oanda+b2 =0,then bj = bp. 


When we write these unique existence statements in prose, we can give ourselves 
the freedom to omit the universal quantifier phrase, knowing it is understood. So 
we could reword these slightly as 


1. There exists a real number x such that x* = 8, and if x1,x. € R satisfy xf =8 
and x = 8, then xj = x2. 
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2. For all a € R, there exists b € R such that a+ b = 0, and if bi, b2 € R satisfy 
a+b; =Oanda+bh,=0,thenb; =). Hf 


EXERCISE 1.3.14 Reword each of the following unique existence statements 
in a form like that in Definition 1.3.12. 

(a) The set A contains precisely one element. 

(b) The function f crosses the x-axis exactly once. 

(c) There is a unique function f such that f(0) = 1 and f’(x) = f(x) for allx € R. 
(d) There exists a unique natural number n such that n > 1 and nis a factor of p. 


(e) The function f(x) = x? — x — 1 crosses the x-axis precisely once in the interval 
O<x <2. 


(f) The curves x* + y* = 1 and 15y = (x — 11/5)? intersect at exactly one point 
in the xy-plane. 


Negations of Statements 


The defining characteristic of the negation of a statement is that the truth table 
values are exactly the opposite. In this section, we see how to construct negations 
of compound statements. Then, if someone makes an ugly statement like 


(pq) > (rVs) 


and we want to say to that person, “Nope, you’re wrong,” then we’ll have a way of 
writing 
“Tp Ag > (rv s)] 


in a more useful and understandable form. 


Negations of AND and OR Statements 


In Exercise 1.1.12, youshowed that —(p A q) is logically equivalent to =p v -q, and 
that =(p v q) is logically equivalent to =p A -q. These facts are called DeMorgan’s 
laws, and they provide us a way of expressing the negations of A and v compound 
statements. 


Example 1.4.1 Construct a negation of the following statements. 


1. pAqgAr 


2. (PDVQA(rvVs) 
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Solution Applying DeMorgan’s laws, we have 


1. A(pAqAr) @& 3apv7-qv rr 


2. -TpVQgMArVS] &ApVAQVArVs) & (ApArqgdVv(FraAns) Hf 


Example 1.4.2 Use DeMorgan’s laws to express in words a negation of the 
following statements. 


1. Jacob has brown hair and blue eyes. 
2. Either I’m crazy or there is a pink elephant floating overhead. 


3. Meghan is at least 25 years old, she has a valid driver’s license, and she 
either has her own insurance or has purchased coverage from the car rental 
company. 


Solution Don’t hesitate to word the statements in a way that makes them easy 
to understand. 


1. Either Jacob does not have brown hair, or he does not have blue eyes. 
2. Iam not crazy, and there is no pink elephant floating overhead. 


3. Either Meghan is under 25, or she doesn’t have a valid driver’s license, or she 
has no insurance of her own and has not purchased coverage from the car rental 
company. Mf 


EXERCISE 1.4.3 State in words a negation for each of the following state- 
ments. 


(a) Either Melanie is naturally blond, or she bleaches her hair. 
(b) Eric has a boarding pass and either a driver’s license or passport. 


(c) Either f is not continuous or it crosses the x-axis at some point. (f is a given 
function.) 


(d) Been there; done that. 


Negations of If-Then Statements 


In Section 1.2, we defined p — gq to be logically equivalent to sp v q. To construct 
a negation of p > q, we can use this fact with DeMorgan’s law. 


“(Pp > De TCPYV 9) & (WTP) Amd > PAE 
This might be a little confusing at first, but later you will want to think of it in 


the following way. If someone makes a claim that p implies q, then he or she is 
claiming that the truth of p is by necessity accompanied by the truth of q. If you 
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want to negate that claim, then you are suggesting that p can be true, while q is 
false. The techniques that follow for negating universal and existential statements 
will help clear that up. 

Example 1.4.4 Construct a negation of the following statements. 

1. p> (qAnr) 

2. (p> qgv(r->s) 


Solution 
1. -[p> GAN] S PATA) & PA (“Gv 71) 
2, Apr gnvr>s)erpr Qantas) e(pAr-gA(rAns) ff 


Example 1.4.5 State in words a negation of the following statements. 


1. If Trent is a senior in the class, then he is exempt from the final exam. 
2. If Meghan has rented a car, then she either has her own insurance or has bought 
coverage from the car rental company. 


Solution 


1. Trent is a senior in the class, and he is not exempt from the final exam. 
2. Meghan rented a car, but she neither has her own insurance nor did she 
purchase coverage from the car rental company. 


EXERCISE 1.4.6 Use the results of this section to construct the symbolic 
negation of the following statements. 

(a) PAV) 

(b) p> @vr) 

(c) PYgo>r 

(d) p> QAP>r) 

(ec) (pAg>r]>s 

(f) pog 


EXERCISE 1.4.7 State in words a negation for each of the following statements. 


(a) If Brett is on the guest list, then he will come to the party. 


(b) If f is continuous, then it crosses the x-axis at some point. (f is a given 
function.) 


(c) No shoes, no shirt, no service. 
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Negations of Statements with the Universal Quantifier 
Suppose someone makes the following claim. 

Every person in the class passed the first exam. 


For you to deny this claim, what sort of statement would you make?® You would 
probably say something like 


At least one person in the class failed the first exam. 


This is where the most formal version of the universal statement comes in handy. 
Let C be the set of students in the class and let P be the set of students in the class 
who passed the first exam. We can write the original statement as 


(Vx)(x€C>xe P) (1.19) 


and its negation would be 
(Ax)(x ECA x ¢ P) 


Notice in the negation that (x € C Ax ¢ P) isa negation of the statement (x € 
C — x € P). Using the property notation P(x), we can therefore say the following. 


=LVx) (P(x))] <> (Ax) (F P(x) (1.20) 


So the trick to negating a universal statement is that the — symbol crawls over the 
(Vx) and converts it to (Ax) as it goes. 
Another symbolic rendering of statement (1.19) would be 


(Vx € C)\(x € P) 


whose negation would be 
(Ax € C)(x ¢ P) 


Notice that the (Vx € C) becomes (Ax € C) as the — symbol crawls over it, then 
the statement (x € P) is negated. 


Example 1.4.8 State in words a negation of the following statements. 


1. For all x € Z,x? > 0. 
2. For all x € Z, if x is divisible by 6, then x is divisible by 3. 


3. Every real number is either rational or irrational. 


© One slacker is all it takes. 
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Solution 


1. There exists x € Z such that x2 < 0. 

2. There exists x € Z such that x is divisible by 6 but not divisible by 3. 

3. There exists a real number that is neither rational nor irrational. 
Warning: Universal statements are often written in if-then form with the uni- 

versal quantifier omitted. This is perfectly acceptable in mathematical discourse, 


but the presence of the universal quantifier must be understood if you are going 
to negate the statement correctly. 


Example 1.4.9 A mathematical statement such as 
Ifx > 1, then x° —x > 0. 
is an acceptable simplification of the more formal 
For all x, if x € Rand x > 1,thenx* —x > 0. 


even though it contains the indeterminate x and does not specify that the context 
is the real numbers. However, the formalized version must be understood in order 
to construct the correct negation: 


There exists a real number x such that x > 1 and x° — x < 0. | 


EXERCISE 1.4.10 State in words a negation for each of the following state- 
ments. 


(a) Ifn is anatural number and n > 5, then n* < 2”, 


? _ x = 0 are nonnegative. 


(b) The only solutions to the equation x 
(c) Ona clear day, you can see forever. 
(d) If —1 <x < 10,then x? < 100. 

(e) Every time a bell rings, an angel gets his wings. 


(f) No one in the class made an A on the final. 


Negations of Statements with the Existential Quantifier 


Now we negate statements involving the existential quantifier. Consider the 
following. 


Someone traveling in my car didn’t chip in enough money for gas. 
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If we let C be the set of people riding in my car and T be the set of tightwads, the 
statement becomes 

(Ax)(xECAxET) 
To negate this, we avoid saying “no one” if possible. It is better to say what everyone 
did instead of what no one did. We could say 


Everyone traveling in my car chipped in enough money for gas, 


or 
(vx € C)(x € T) 
which is equivalent to 
(Vx)\(xECox¢€T) 
Notice in the negation that (x € C > x ¢ T) is the negation of (xE€ CAx€T). 
Thus we arrive at the following. 


[x) (P@))] <> (Vx) (F P(x) (1.21) 


This looks similar to statement (1.20) in that negating an existential statement can 
be done by letting the — symbol crawl over the J, changing it to V as it goes. 

If someone says to you that a certain something exists, and it is a claim you 
want to deny, then avoid using the expression “There does not exist.” Instead, 
express your negation in positive language by saying that all things fail to have 
the desired property. 


Example 1.4.11 Construct a negation of the following statements. 


1. Gx eN)\(x < 0) 
2. (Ve > 0)(an E N)(1/n < ©) 


Solution 


1. (Wx e N)( > 0) 
2. (de>O)(Wn EN)A/n>c) 


Example 1.4.12 State in words a negation of the following statements. 


1. Someone in this class cheated on the final exam. 

2. There exists a natural number x such that x < y forall y EN. 
3. There exists M € R such that x < M forall x € S. 
4. 


For all ae A and ¢ > 0, there exists 6 > 0 such that, if 0 < |x —a| < 6, then 
| f(x) — L| <e. 
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Solution 


. Everyone in this class did not cheat on the final exam. 
. For every natural number x, there exists a natural number y such that x > y. 


. For all real numbers M, there exists x € S such that x > M. 


BR WO NO RF 


. There exists a € A and € > Osuch that, for all 6 > 0, there exists an x satisfying 
0 <|x-—a| <édand|f(x)-—L|>e«. Hf 


EXERCISE 1.4.13 Construct a negation of the following statements. 


(a) a>1Aa<10 

(b) a>120Va<80 

(c) (Wx)? — 4x +3=0> x>1) 
(d) Ax) > 2A x? —3x =2) 

(e) (Va € R)\b € R\(a+b =0) 
(f) (lx)(P@)) 

(g) (Va € R)\Alb € R)\(a+b =0) 
(h) (im € A)(Vx € A)(x < m) 


(i) (Vx)(xe A> xe B) 
(Gj) G@)@eAAxe B) 
(k) There is an element of set A that is less than every element of set B. 
(1) Every element of A is greater than every element of B. 
(m) There exists an element of the set A that exceeds m. 
(n) The graph of every function in F intersects the x-axis at least once. 
(o) There is a function in F whose graph intersects the x-axis. 
(p) No function in F has a graph that intersects the x-axis. 
(q) There is a function in F whose graph does not intersect the x-axis. 
(r) The equation x? = 10 has a real number solution. 
(s) There exist M1, M2 € Rsuch that Mj <x < Mp forallxe S. 
(t) There exists a unique x € A. 
(u) Every real number has a multiplicative inverse. 


(v) Every integer is either even or odd. 
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How We Wrrite Proofs 


A theorem is a statement of the form U — V. To “prove” a theorem in the most 
abstract sense is to show that U > V is a tautology, for then we know that U is at 
least as strong as V. Thus, as we say, V follows from U, meaning that any context 
in which U is true will guarantee that V is also true. 

Only the most theoretical mathematicians approach theorems as tautologies, 
and even then such mathematicians are probably more interested either in the 
philosophy of mathematical reasoning or in formal methods as it relates to fields 
such as computer science. In this text, we do hope that some mathematical phi- 
losophy rubs off on you. But we are primarily interested in proofs as almost all 
mathematicians write them. Thus we focus on what it means to write a proof in 
mathematical prose, so that it is accurate, clear, and readable. However, just so you 
will not feel cheated, we will provide one example (in Section 3.2) of a formalized 
theorem proved by demonstrating it is a tautology. 

If we are not going to use tautologies in our proof writing, then why have we 
spent time studying logic and truth tables in this chapter? There are two primary 
reasons. One is that tautologies provide us with rules for valid reasoning that will 
undergird our mathematical writing. For example, in Exercise 1.2.4 you showed 
that [(p > q) A p] > qisa tautology. This tautology is called the modus ponens, 
and it is one of several formalized rules of inference that comprise mathematical 
reasoning. If we assume p and that p > q, then the modus ponens rule of inference 
says we may therefore conclude q. It is not our purpose here to define all the rules 
of inference, though you have seen most of them at some point in this chapter. 
Exercise 1.2.4 contains the names of several rules of inference. 

The second reason we have studied logic and truth tables is that it will provide 
us with approaches to the writing of proofs. In particular, we need to understand 
how to dissect mathematical statements (the term is to parse sentences) and see 
how they give order and structure to our proofs. We also need to know that we can 
sometimes write proofs by exploiting equivalent statements. Furthermore, a basic 
understanding of logic can immunize us from making certain fallacies —statements 
that are not tautologies and therefore not valid theorems. 

In this section, we want to outline the four types of demonstrations you will 
employ throughout this text when you are asked to compose a proof of a mathe- 
matical proposition. Know that you will always use one of these methods of attack 
in any demonstration. Knowing which to choose depends on what your task is, 
and is illumined by experience. 


Direct Proof 


Most theorems are proved directly, which means that the statement of the theorem 
and its proof fit into the following template. 


Theorem 1.5.1 (Sample). If p, then g. 


Proof. Suppose p. Then ....Thus g. 


1.5.2 
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Our purpose here is only to see where a direct proof begins and ends. The “dot 
dot dot” is what varies from situation to situation. Here is an example theorem. 


Theorem 1.5.2 (Sample). For all integers x and y, if x and y are both odd, then 
xy is odd. 


Proof. Suppose x and y are both odd integers. Then .... Therefore xy is odd. 


Theorem 1.5.3 (Sample). If a, — 0 as n — oo and (bn)°, is bounded, then 
Anbn > Oasn > ~. 


Proof. Suppose a, — 0 as n > oo and that (bn)°°, is bounded. Then .... Thus 
Anb, > Oasn > oo. 


EXERCISE 1.5.4 Each of the following theorems is to be proved directly. State 
the first and last sentences of the proof of the theorem. 


(a) If f: A— Band g: B > C are one-to-one, then (go f) : A > C is one-to- 
one. 


(b) Ifaset A contains all its cluster points, then A is closed. 

Proof by Contrapositive 

If your task is to prove a theorem of the form p — q, then it suffices to prove any 
statement that is equivalent to p —> q. The contrapositive is the most common 


such equivalent, and a proof by contrapositve always goes something like this. 


Theorem 1.5.5 (Sample). If p, then q. 


Proof. Suppose —q. Then .... Thus —p. 


Theorem 1.5.6 (Sample). Ifx¢ AU B,thenx ¢ AUB. 


Proof. Suppose x ¢ AU B.Then.... Thus x ¢ AU B. 


EXERCISE 1.5.7 Each of the following theorems is to be proved by contrapos- 
itive. State the first and last sentences of the proof of the theorem. 


(a) If A C B, then A — Bis empty. 
(b) If ab is irrational, then either a is irrational or b is irrational. 
Proving a Logically Equivalent Statement 


If a theorem says p > q, then you may prove it by showing —q — —p because it 
is logically equivalent to the given theorem. There are many other mathematical 
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propositions that are best proved by exploiting a logically equivalent form. Here 
is acommon one. 


Theorem 1.5.8 (Sample). p— (qv 7r). 


Proof. Suppose p A 7q. Then .... Thus r. 


EXERCISE 1.5.9 Find the appropriate exercise in Section 1.2 that validates the 
proof of Theorem 1.5.8. 


EXERCISE 1.5.10 Each of the following theorems might be proved naturally by 
exploiting a logical equivalence from Section 1.2. State the first and last sentences 
of the proofs of the theorem. 

(a) (PAQ>r 

(b) If p divides ab, then p divides a or p divides b. 


(c) Ifa is rational and ab is irrational, then b is irrational. 


Proof by Contradiction 
If U represents the statement of a theorem, then U is a tautology. Thus —-U is a 
contradiction. Sometimes the most natural way to prove U is a tautology is to 


show that —U is a contradiction. You’re gonna love this. 


Theorem 1.5.11 (Sample). U. 


Proof. Suppose =U. Then... KABOOM!’ This is a contradiction. Thus U. 


EXERCISE 1.5.12 Each of the following theorems is to be proved by contradic- 
tion. State the first and last sentences of the proofs of the theorem. 


(a) p> 4 
(b) The natural numbers are unbounded in the reals. 
(c) There are infinitely many prime numbers. 


(d) 1>0 
Disproving a Statement 


Sometimes we are presented with a mathematical statement that is not true, and 
we are asked to verify this by disproving the given statement. To do this, we prove 


7 Mathematicians have been known to indulge in the occasional exclamation. 
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the negation of the statement. This is where our techniques of negation from 
Section 1.4 come in handy. 


Example 1.5.13 Each of the following statements can be disproved. By 

negating the statement, provide the statement that could be proved true. 

1. For all sets A, B,and C,if AUC = BUC,then A = B. 

2. For all sets A, B, and C, and for all functions f: A—> Bandg:B->C,if f 
and go f are one-to-one, then g is one-to-one. 


Solution 


1. There exist sets A, B,and C such that AUC = BUC and A #B. 
2. There exist sets A, B, and C and functions f : A > Band g: B > C such that 


f and go f are one-to-one, but g is not one-to-one. Hf 


Both parts of Example 1.5.13 are universal statements, so that their negations 
are existential statements. When one disproves a universal statement, we say that 
we are finding a counterexample. 


Example 1.5.14 Show that the first statement in Example 1.5.13 is false. 


Solution Let A = {1}, B = {2}, and C = {1,2}. Then AUC = BUC, but A FB. 
| 


EXERCISE 1.5.15 Suppose each of the following statements could be dis- 
Pp g 

proved. By negating the statement, provide the statement that would be proved 

true. Assume that S is a given set. 

(a) Forallx € S,x =x. 

(b) There exists a real number M such that x < M forallx € S. 

(c) If xy || x2 and x2 || x3, then x; || x3. 


(d) For every function f and every set S, f~'[f(S)] € S. 
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2.1 


Properties of Real Numbers 


It’s time to begin applying the language and logic of Chapter 1 to proof writing. In 
this chapter, you will learn to write proofs of important properties of real numbers 
that follow from assumptions A1—A22 outlined in Chapter 0. 

There are many principles of algebraic manipulation that you exploit without 
thinking. Here are some examples. 
If 2x + 3 = 13, then 2x = 10. 
If 2x = 10, then x = 5S. 
Ifa <bandb <c,thena <c. 
If |x| < 3, then —3 < x < 3. 
If ab = 0, then either a = Oorb = 0. 

Each of these properties can be proved as a theorem from assumptions 
A1-A22. Our tasks in this chapter are to understand how principles of algebraic 


manipulation follow from these assumptions and to learn to write proofs of such 
theorems. 


Basic Algebraic Properties of Real Numbers 
We begin by writing proofs of some of the most basic algebraic properties of real 
numbers. We will remind ourselves of the relevant assumptions as we need them. 


The properties of equality are of primary importance. 


(Al) Properties of equality: 


(a) Forallac Ra=a. (Reflexive) 
(b) For alla,be R,ifa=b,thenb=a. (Symmetric) 
(c) Foralla,b,cé€R,ifa=bandb=c,thena=c. (Transitive) 
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The transitive property of equality allows us a convenience that we probably 
take for granted. When we write a = b = c = d, we interpret this as meaning that 
a= d. However, we must understand what is behind such a condensed statement. 
First, it includes the statement (a = b) A (b = c), from which we conclude a = c by 
the transitive property. Second, it also includes the statement c = d, from which 
we conclude a = d, again by the transitive property. 


Properties of Addition 


There are six properties of real number addition that we take as axioms. Here they 
are again for reference. 


(A2) Addition is well defined: For all a,b,c,d¢R, if a=b and c=d, then 
a+c=b-d. 


(A3) Closure property of addition: For all a,b¢ R,a+beR. The closure 
property also holds for N, W, Z, and Q. 


(A4) Associative property of addition: For alla, b,c €R, (a+b) +c=a+(b+c). 
(A5) Commutative property of addition: For all a,b ¢ R,at+b=b-+a. 


(A6) Existence of an additive identity: There exists a number 0 € R with the 
property thata+0=aforallaeR. 


(A7) Existence of additive inverses: For all a € R, there exists some b € R such 
that a+ b= 0. Such an element b is called an additive inverse of a and is 
typically denoted —a. 


With these assumptions, we can now prove several properties of addition that 
follow from them. Read the proofs carefully, because you will mimic them to prove 
analogous properties for multiplication. 


Theorem 2.1.1 (Cancellation of addition). For all a,b,ceR, if a+c= 
b+c,thena=b. 


Proof. Pick a,b,c € R and suppose a+c=b-+c. By A7, there exists -ce R 
such that c+ (—c) = 0. Since addition is well defined (A2), we have that [a+ 
c]+ (-c) = [b+ c] + (—c), which by A4 we may write as a+[c+ (—c)] = b+ 
[c + (—c)]. This yields a + 0 = b + 0, which becomes a = b. 


We’re going to present a second proof of Theorem 2.1.1, written in a some- 
what different style. The previous proof reveals the thought process involved in 
constructing the proof, but it uses several disjoint equations. The next proof shows 
how you might clean up these equations to create one extended equation that is 
the result we want. 
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Proof 2. Suppose a, b, and c are real numbers such that a+c=b+c. By A7, 
there exists —c € R such that c+ (—c) = 0. Thus with several other properties 
from Al—A7 we have that 


aM a4 0PM at [c+ (—c)] ‘2 Grote? 
(2.1) 
(A2) 


Se OLOE KO = Pee = b40 = 2 


By A1 (the transitive property of equality), a = b. 


Because addition is commutative, Theorem 2.1.1 also allows us to say that if 
cta=c+b,thena=b.Forifc+a=c+b,then 


a+c=cta=c+b=b+c 


which by Theorem 2.1.1 implies a = b. Thus we have both left cancellation and 
right cancellation. 

We will eventually get a little lazy about referencing the real number properties 
that we need to use, especially the three properties of equality. At first, however, 
we need to make sure for our own sakes that we understand precisely which ones 
we use and when we use them. 

The next two results are uniqueness theorems. Recall what it means to say that 
a particular mathematical object is unique: If x; and x2 are both assumed to have 
the desired property, then x1 = x2. 


Theorem 2.1.2 The additive identity is unique. 


Proof. Suppose 0; and 02 are both additive identities. Then 


0; = 0; + 02 = 02 


Assumption A7 guarantees that every real number a has an additive inverse 
—a for which a + —a = 0. This is an axiom. But nothing about the axioms Al—A22 
precludes (at first glance) that a real number might have more than one additive 
inverse. The next theorem shows that every real number has a unique additive 
inverse. Once we prove this, we will then be able to talk about the additive inverse 
of a. Understand what the uniqueness component of the existence of additive 
inverses would say. For all b;, b2, if b; and b2 are both additive inverses of a, then 
by = by. 


Theorem 2.1.3 The additive inverse of a real number is unique. 


Proof. Pick a € R, and suppose by, b2 € R are both additive inverses of a. Then 
a+b; = 0 and a+ b2 = 0. Thus by the transitive property of equality, a+b, = 
a+b, so that b) = b2 by Theorem 2.1.1. Therefore the additive inverse of a is 
unique. 
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Theorem 2.1.4 For every a € R, —(—a) =a. 


Proof. Pick a € R. Then —a exists in R, where a + (—a) = 0. But then —(—a) also 
exists in R, where (—a) + [—(—a)] = 0. Thus (—a) + [—(—a)] = a+ (—a), which 
by cancellation yields —(—a) = a. 


Theorem 2.1.5 For alla,b € R, —(a+b) = (—a) + (-D). 


Proof. Pick a, b € R. Then —a and —b both exist in R. By A3, (—a) + (—D) is also 
a real number. Exploiting associativity and commutativity of addition, we have 


(a+ b) + [(—a) + (-b)] = [a + (—a)] + [b+ (—b)] =0+0=0 (2.2) 
Thus —(a+ b) = (—a) + (—D). 


One of the oldest gags around says that, if it quacks like a duck, it must be a 
duck. That is, the defining characteristic of ducks is that they quack. Now suppose 
you know that there is only one duck in the world, and you have just found an 
animal that quacks. Then, by golly, you have found the duck. 

In Theorem 2.1.5, the duck we are looking for is —(a+ b). That is, we are 
looking for some real number, which, when added to (a+ b), yields zero. After 
all, that is the quacking feature of —(a + b). Theorem 2.1.5 claims that the number 
(—a) + (—b) quacks, and Eq. (2.2) is the demonstration of that claim. Finally, since 
the additive inverse of a real number is unique, we know that this quacking thing 
we have found is indeed —(a + b). 

Theorem 2.1.5 gives us the right to make a statement like the following: 


The additive inverse of a sum is the sum of the additive inverses. 


It is quite common in mathematics to investigate the truth of a statement of the 
form 


The X of the Y is equal to the Y of the X. (2.3) 


Statements of this form can be very powerful if true. However, sometimes such 


a statement is false. For example, let X = “square root” and let Y = “sum” and 
you have the statement /a+ b= /a+ Vb. 


EXERCISE 2.1.6 Calling on your experience from previous math courses, con- 
struct several statements with the same form as (2.3), some of which are true and 
some false. 


The behavior of zero yields the next theorem. 


Theorem 2.1.7 —0=0. 


Proof. Since 0 is the additive identity, 0+ (—0)=-—0. Also, since —0 is 
the additive inverse of 0, by definition it satisfies 0 + (—0) = 0. Thus we have 
—0=0+4 (-0) = 0. 
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2.1.2 Properties of Multiplication 


Now it’s time for you to write your first proofs. We begin with some theorems 
that are exactly analogous to Theorems 2.1.1—2.1.7, except that they are about 
multiplication instead of addition. We have to be a bit more careful with multi- 
plication because multiplicative inverses are assumed to exist only for nonzero 
real numbers. Thus there are a few more results to prove for multiplication than 
there were for addition. First, we recall the relevant assumptions from Chapter 0. 
Note that we include the distributive property here (A14), as well as that very odd 
assumption that 1 4 0 (A15). 


(A8) Multiplication is well defined: For all a,b,c,d¢R, if a=b and c=d, 
then ac = bd. 


(A9) Closure property of multiplication: For all a, b € R,a-b € R. The closure 
property of multiplication also holds for N, W, Z, and Q. 


(A10) Associative property of multiplication: For all a, b,c € R, (ab)c = a(bc). 


(A11) Commutative property of multiplication: For all a, b € R, ab = ba. 


(A12) Existence of a multiplicative identity: There exists a number 1 € R with 
the property that a- 1 =aforallae R. 


(A13) Existence of multiplicative inverses: For all nonzero a € R, there exists 
some b € R such that ab = 1. Such an element b is called a multiplicative 
inverse of a and is typically denoted a~!. 


(A14) Distributive property of multiplication over addition: For all a,b,c € 
R, a(b+c) = ab +ac. 


(A15) 140. 
EXERCISE 2.1.8 Foralla,b,c € R,ifac = be andc £0, thena = b. 
EXERCISE 2.1.9 The multiplicative identity is unique. 


EXERCISE 2.1.10 The multiplicative inverse of a nonzero real number is 
unique. 


Cancellation of addition comes in handy in the next theorem. 


Theorem 2.1.11 For every real number a, a-0 = 0. 


Proof. Pick a € R. By A6é and A14, we have that 
0+a-0=a-0=a-(0+0)=a-0+a-0 (2.4) 


By Theorem 2.1.1, we may cancel a-0 from both sides of Eq. (2.4) to have 
a-0=0. 
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Occasionally, a theorem and its proof yield a special result that follows imme- 
diately but is important in its own right. Such a result is called a corollary. Some- 
times the truth of the corollary is so transparent that it does not require its own 
proof. Sometimes, however, a word of explanation is in order. 


Corollary 2.1.12 Foralla € R,ifa 4 0,thena! £0. 


Proof. Pick any nonzero a €R. Since a-0 =0 and 0 41, then a-0 41. Thus 
-1 
a # 0. 


EXERCISE 2.1.13 Foralla 40, (a-!)-! =a. 

EXERCISE 2.1.14 For all nonzero a,b ER, (ab)! =a'b 1! 
EXERCISE 2.1.15 1-1 =1. 

EXERCISE 2.1.16 Forallae€R, (—l)a= —a2 


EXERCISE 2.1.17. For all a, b € R, the following hold: 
(a) (—a)b = —(ab)? 
(b) (—a)(—b) = ab*? 


Suppose we have chosen some nonzero real number a. First take its additive 
inverse —a, and then take the multiplicative inverse of that to have (—a)~!. Now 
start with a again, and perform the same two processes in reverse order. First take 
the multiplicative inverse of a to have a~!; then take the additive inverse of that to 
have —(a~!). You probably expect that these two processes, when done in reverse 
order, produce the same result, but if so, it must be demonstrated. Look closely at 
the proof of Theorem 2.1.18, for this kind of stunt can come in handy sometimes. 


Theorem 2.1.18 For all nonzero real numbers a, (—a)~! = —(a7!). 


Proof. Suppose a # 0. Then there exists a~! € R, and by Exercise 2.1.17(b), 
l=a-a'=(-a)[-(a')] (2.5) 


By Eq. (2.5), we see that the multiplicative inverse of —a is —(a~!). That is, 
(-a) "= —(@"!). 


! Think about ducks. 

2 Ducks. 

3 You don’t have to think about ducks. Just use previous results to transform the left-hand side into the 
right-hand side in several steps. 

4 Do you find yourself in need of the statement (—1)(—1) = 1? How do you know this is true? Look at 
the next hint if you have to. 

5 It’s true because (—1)(—1) = —(-1) = 1 (Exercise 2.1.16 and Theorem 2.1.4). 
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EXERCISE 2.1.19 [Principle of zero products] If ab = 0, then either a = 0 or 
b= Oe 


EXERCISE 2.1.20 Foralla,b,c,d €R,(at+b)(c +d) =ac+ad + be + bd. 


EXERCISE 2.1.21 Suppose we replace assumption A15 with the assumption 
that 1 = 0. Show that, with this assumption, there are no nonzero real numbers. 


EXERCISE 2.1.22 In this section two results in addition to Theorem 2.1.5 can 
be worded as “The X of the Y is equal to the Y of the X.” Find them, and state them 
in this form. 


Ordering Properties of the Real Numbers 
In Chapter 0, we stated three assumptions about how the real numbers are 
ordered. 
(A16) Trichotomy law: For any a € R, exactly one of the following is true: 
(a) a > 0,in which case we say a is positive 
(b) a=0 
(c) 0 > a,in which case we say a is negative 
(A17) For alla,b € R,ifa > Oandb > 0,thena+b>0. 
(A18) For all a,b € R,ifa > 0andb > 0, then ab > 0. 


We address the sign of real numbers by assuming that every nonzero real 
number can, by the trichotomy law, be compared to zero in one way or another 
by the symbol >. If a > 0, we say that a is positive, and if 0 > a (or a < 0), we say 
that a is negative. Notationally, we write the positive and negative real numbers 
as R* and R-, respectively. We have therefore split the real numbers into three 
pieces, R™, {0}, and R-, and we have two closure assumptions about R*. 

We can now assign meaning to the statement a > b, as we did in Definition 0.2.1. 
We restate that definition here for reference. Recall from Chapter 0 that a — b = 
a+ (—b) by definition. 


Definition 2.2.1 Given real numbers a and b, we say that a > b provided 
a—b->0. The statement a < b means b > a. The statement a > b means that 
either a > b or a= b. Similarly, a < b means either a < bora=b. 


So anytime we see a statement a > b, it means precisely that a — b > 0. Sim- 
ilarly, if someone asks you to demonstrate that a > b, we can do it by showing 
a—b > 0. Definition 2.2.1, with Theorem 2.1.4, allows us to prove the following. 


® See Exercise 1.2.18(h). 
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Theorem 2.2.2 Ifa> b,then —a < —b. 


Proof. Suppose a > b. Then a— b > 0. But a = —(—a), so that —(—a) —b > 0, 
which we may write as —(—a) + (—b) > 0. By commutativity, (—b) + [—(—a)] > 0, 
or —b — (—a) > 0,so that —b > —a. Hence —a < —b. 


Corollary 2.2.3 Suppose c is any real number. Then c > Oif and only if —c < 0. 
Proof. (=) Suppose c > 0. Then we may let a = c and b = 0 in Theorem 2.2.2 
to have —c < -0=0. 


(<=) Suppose —c < 0. Then we may let a = 0 and b = —c in Theorem 2.2.2 to 
have —0 < —(—c),orc > 0. 


Assumption A15 states that 1 4 0, which might seem silly, but actually is an 
essential assumption. In Exercise 2.1.21, you showed that the assumption 1 = 0 
causes the entire set of real numbers to collapse to the single element set {0}. 
Since 1 4 0 by assumption, the trichotomy law implies that either 1 > 0 or 1 < 0. 
To tackle the next exercise, try a proof by contradiction. 


EXERCISE 2.2.4 1>0. 


EXERCISE 2.2.5 Prove the following. 


(a) Ifa<b,thena+c<b+ec. 

(b) Ifa < bandc=d,thena+c<b+d. 
(c) Ifa<bandc <d,thena+c<b+d. 
(d) Ifa<bandc <d,thena+c<b+d.! 
(e) Ifa <bandb <c,thena <c. 

(f) Ifa>bandb>c,thena>c. 

(g) Ifa <Oandb <0,thena+b <0. 
(h) Ifa > Oandb < 0, then ab < 0. 

(i) Ifa <Oandb < 0, then ab > 0. 

(j) Ifa < bandc > 0,thenac < be. 

(k) Ifa < bandc <0, then ac > be. 


EXERCISE 2.2.6 Fora real number a, write a2 = a-a. Prove the following. 


(a) If0 <a <b,thena* < Db’. 
(b) Ifa <b <0,thena* > b’. 


7 Case it out. 
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(c) For all real numbers a, a >0. 


2 


(d) The equation x“ = —1 has no real number solution x. 


EXERCISE 2.2.7. Prove the following. 


(a) 0~! does not exist in R. 
(b) Ifa>0,thena™! > 0. 
(c) Ifa <0,thena! <0. 
(d) c > 1if and only if0 <c7! <18 


(e) Ifa>O0andc > 1,thena/c <a. 


+1 10 


(f) Ifaand bare integers such that ab = 1, thena=b=4 
EXERCISE 2.2.8 Ifaandb are nonzero real numbers anda < b, does it follow 
that 1/a > 1/b? Use results from Exercises 2.2.5 and 2.2.7 to state and prove the 


relationship between 1/a and 1/b depending on the signs of a and b. 


EXERCISE 2.2.9 Prove thatifa < b,thena < (a+ b)/2 < b. (How do you know 
that 2 > 0? What is 2 anyway?) 


2.3 Absolute Value 


You are certainly familiar with the absolute value of real numbers: 


Definition 2.3.1 For a real number x, we define |x|, the absolute value of x, by 


x, ifx>0 
|x| = 0, ifx=0 (2.6) 
—x, ifx <0 


The fact that |x| is defined in three cases means that proofs of theorems 
often have to address the cases separately. In many situations, two cases suffice, 
by noting that the following is equivalent to Definition 2.3.1. 


x, ifx>0 (2.7) 


|x| = ; 
—x, ifx<0O 


8 To show 0 < c7! < 1, you must show the two inequalities 0 < c7! and c7! < 1 separately. 

9 Recall a/c = acT}. 

10 There are five cases: a = —l,a=0,a=1,a>1, anda < —1. Three are impossible. Remember that 
1 is the smallest positive integer. 
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Absolute value is a very important measure of the size of a real number. We 
can make two observations right off the bat. 


(N1) For every real number x, |x| > 0. 
(N2) |x| = 0if and only if x = 0. 


First let’s look at property N1. Pick any real number x. If x > 0, then |x| = 
x > 0. Ifx < 0, then |x| = —x > 0 by Corollary 2.2.3. Thus in any case, |x| > 0. 

Property N2 follows directly from Corollary 2.2.3 and the trichotomy law. The 
reason we point these out is that properties N1—N2 are two of the three defining 
properties of a norm, a very important term in analysis. Given a set (perhaps of 
numbers, functions, sets) a norm is a measure of the size of its elements. There is a 
third property of a norm that absolute value has, and we will see it in Exercise 2.3.8. 

First let’s explore some of the simplest and most familiar behaviors of absolute 
value. We will prove a few, either wholly or in part, and leave some to you as 
exercises. The first one is really easy, so it’s all yours. You'll probably want to use 
the three cases in Eq. (2.6). 


EXERCISE 2.3.2. For all real numbers x, |—x| = |x|. 


Theorem 2.3.3 Let a> 0 be a given real number. Then |x| = a if and only if 
X= md. 


Proof. Let a > 0 be given. 


(=) Suppose |x| = a. If x > 0, then x = |x| = a. If x < 0, then x = — |x| = —a. In 
either case, x = ka. 


(<=) Suppose x = +a. First note that if a = 0, then |x| = x = a. So suppose a > 0. 
For the case x = a, we have that x > 0, so that |x| = x =a. For the case 
x = —a, we have that x < 0. Thus |x| = —x = a. In all these cases, |x| = a. 


The next result is an if-and-only-if theorem, where each direction can be 
handled by considering the two cases x > 0 and x < 0. 


Theorem 2.3.4 Let a > 0 be a given real number. Then |x| < a if and only if 
—a<xX<a. 


Proof. Let a > 0 be given. 
(=) Suppose |x| < a. If x > 0, then 
—a<O<|x|=x<a (2.8) 


On the other hand, if x < 0, we may write — |x| > —a to have 


a<—|x)|=—-—(-x)=x<O0<a (2.9) 


In either case, we have —a < x < a. 


11 
12 


13 
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(<) Now suppose —a < x < a. If x > 0, then |x| = x < a. If x < 0, we note that 
—a <x is equivalent to —x < a, and we have |x| = —x < a. In either case, 
|x| < a. 


We may apply Exercise 1.2.18(j) to derive a corollary to Theorems 2.3.3 
and 2.3.4. Define the following statements: 


p: |xi=a q:x=H#a r: |x|}<a s: -a<x<a (2.10) 


By Exercise 1.2.18(j), (p = q) A (r © 5) is stronger than (p V r) = (q Vs). Wecan 
therefore combine Theorems 2.3.3 and 2.3.4 to have the following. 


Corollary 2.3.5 Let a> 0 be a given real number. Then |x| < a if and only if 
-—a<x<a. 


You can prove the next result by casing it out, if you like. Or you can apply 
Corollary 2.3.5 and some sneaky logic to prove it quickly. 


EXERCISE 2.3.6 Leta>Obea given real number. Then |x| > a if and only if 
either x > aorx < —a. 


EXERCISE 2.3.7 For all real numbers x, — |x| < x < |x|. 
Now let’s state the third property of a norm, and show that |x| has this impor- 


tant property. Exercise 2.2.5(d), Corollary 2.3.5, and Exercise 2.3.7 will provide 
just the right mathematical machinery to prove the following. 


EXERCISE 2.3.8 [N3: Triangle Inequality] For all real numbers x and y, 
le + yl <del + yl (2.11) 


Another triangle-type inequality can be proved in two lines from Exercise 2.3.8, 
if you can see how to apply it creatively. 


EXERCISE 2.3.9 For all real numbers x and y, |x — y| = [x] — ly|.12-25 
There is one more triangle-type inequality that is important in analysis. The 


proof requires some sneaky application of several of the results we have shown 
so far. 


Find M such that —M < x+ y < M and apply Corollary 2.3.5. 

One of the mathematician’s most useful tricks is knowing when and how to add zero. Start with 
Exercise 2.3.8, and reassign the roles of x and y. Don’t look at the next hint unless you have to. 

Start with |x| by itself and add zero inside the absolute value. 
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EXERCISE 2.3.10 For all real numbers x and y, 


|x| — |yl| < lx—yI."4 


Exercises 2.3.8—2.3.10 reveal how addition and subtraction relate to absolute 
value. The final exercise for this section reveals how multiplication and division 
relate to absolute value. 


EXERCISE 2.3.11 Suppose x and y are real numbers. Then the following are 
true. 


(a) |xyl = |x| lyI- 
(b) If y 4 0, then |y~1| = |y|7?.9 


(c) If y £0, then |x/y| = |x|/Iyl. 


The Division Algorithm 


This section is devoted to one very important theorem about the integers. Even 
though it is called the division algorithm, it is not an algorithm in the sense of 
computer science, where we think of an algorithm as a step-by-step process for 
performing a task. Instead, the division algorithm is merely a unique existence 
theorem, but a very useful one. 

One way to assign meaning to the words even integer and odd integer is the 
following. 


Definition 2.4.1 Ifn is an integer, we say that n is even provided there exists an 
integer k such that n = 2k. We say that n is odd provided there exists an integer k 
such that n = 2k + 1. 


Let’s prove some easy results about even and odd integers. Notice how the 
proof of the first one appeals to the definition of even integer in both directions. 
First, when we are given that a particular integer n is even, then we can claim the 
existence of some integer k; such thatn = 2k. Then, when we need to demonstrate 
that some integer p is even, we must be able to find some integer kz such that 
Dp = 2k. 

We present two slightly different proofs of the next theorem to illustrate subtly 
different tastes for the flow of a proof. 


Theorem 2.4.2 Let m andn be integers, at least one of which is even. Then mn 
is even. 


Proof 1. Pick integers m and n, and suppose (without loss of generality) that 
n is even. Then there exists integer ky such that n = 2k,. Thus we have that 


Apply Corollary 2.3.5 by writing the expression in Exercise 2.3.9 in two ways, once switching the roles 
of x and y. 
Use a technique like the proof of Theorem 2.1.18. 
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mn = m(2k1) = 2(mk1), so that we may let k2 = mk; (which is an integer) to have 
mn = 2k. Thus mn is even. 


Proof 2. Pick integers m and n, and suppose (without loss of generality) that n 
is even. Then there exists integer k; such that n = 2k,. Let ky = mk,, which by 
closure is also an integer. Then 


mn = m(2k1) = 2(mk1) = 2k (2.12) 


Thus mn is even. 


Corollary 2.4.3. Ifn is an even integer, then n7 is even. 


Proof. Let m = n in Theorem 2.4.2. 


EXERCISE 2.4.4 If mandn are odd integers, then mn is odd. 


Corollary 2.4.5 Ifn is an odd integer, then n” is odd. 
EXERCISE 2.4.6 Prove the following. 


(a) The sum of two even integers is even. 
(b) The sum of two odd integers is even. 


(c) The sum of an even integer and an odd integer is odd. 


At this point, you’re likely thinking that Definition 2.4.1 addresses the only two 
possible situations that can happen in the integers. After all, isn’t every integer 
either even or odd, and not both? Well, maybe, but how do you know that? How 
do you know that every integer can be written either in the form 2k or 2k + 1, but 
not both? This, and a whole lot more, is addressed by the division algorithm. 

Ever since elementary school, you have been familiar with the idea of dividing 
an integer b by another integer a > 0 to produce a quotient q and remainder r. One 
way you could express the results of your division calculation in an equation is to 
write 


b=aq+r (2.13) 


where 0 <r <a. For example, if a = 12, b = 88, we can write 88 = (12)(7) + 4, 
and if a = 6, b = —13, we have —13 = (6)(—3) + 5. One nice thing about the form 
of Eq. (2.13) is that every number involved is an integer. What resembles division 
of integer by integer is written as an integer equation without having to resort to 
rational numbers. The division algorithm says that the existence of a quotient g and 
a remainder 0 < r < a are guaranteed. Even more, they are unique. The theorem 
is a surprisingly useful one, as you will come to see. In its proof we need an 
assumption from Chapter 0 that we have not used yet: the well-ordering principle 
(WOP). 
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(A19) Well-ordering principle: Any non-empty subset of whole numbers has a 
smallest element. That is, if A is non-empty, then there is some number 
a € A with the property that a < x for all x € A. 


Theorem 2.4.7 (Division Algorithm). Let a and b be integers, where a > 0. 
Then there exist unique integers g and r such that b = aq+rand0<r<a. 


We provide the existence part of the proof here. Then you will show uniqueness 
as an exercise. In showing existence for the case b > 0, we use the set 


S={b-—aq:qé€Z, b—aq>=0} (2.14) 


which is merely the set of all whole numbers you can generate by subtracting 
integer multiples of a from b. For example, if a = 12 and b = 105, then 


S = {...,129, 117, 105, 93, 81, 69, 57, 45, 33, 21, 9} (2.15) 


The last element of S is 105 — 12q, where q = 8. Notice that it is the smallest 
element of S and is strictly less than 12. By constructing the corresponding S 
according to Eq. (2.14) for an arbitrary a > 0 and b > 0, we can apply the WOP 
to S, then show that this smallest element is the value of r we’re after. Here is the 
proof of the existence part of Theorem 2.4.7, in as much detail as we promised. 


Proof. (Existence) Let a and b be integers, where a > 0. First, we consider the 
case b > 0. Define the set S as in Eq. (2.14). By definition, S C W, and since b > 0, 
we may let g = 0 to see that b € S, so that S is non-empty. By the WOP, S has a 
smallest element, which we may denote r, and note that r is of the form b — aq 
for some integer q. Thus we have that b = aq +r, where r > 0, and we must show 
that r < a. To showr < a,suppose r > a (from which we will arrive at a contradic- 
tion). Then b — a(q+1) =b-—aq-—a=r—a>0,sothat b—a(q+1) € S. Also, 
b—a(q+1) =b-—aq-—a < b—aq=r. Thus b — a(q +1) is an element of S that 
is smaller than r. This is a contradiction, because r was chosen to be the smallest 
element of S. Thus it must be that r < a. 

Now suppose b < 0, so that —b > 0. Applying the previous case to —b, there 
exist integers g; and r; such that (—b) = aq, +r; and0 <r, < a. Nowif r; = 0, 
then —b = aqj, so that b = a(—q1). Thus we may let g = —q; and have that b = 
aq + 0. So suppose that 0 <r; < a. Then b = a(—q) — ry = a(q1 + 1) — (2-1). 
Letg = qi + landr = a—1r,.Thengandrare integers, and the fact thatO < 11 <a 
implies that —a < —r; < 0,so that0 <a—1r <a.Thus0<r<a. 


EXERCISE 2.4.8 Show uniqueness of g and r in Theorem 2.4.7.!° 


Suppose q1, 92,171,172 € Zare such that b = aq, + ry (0 < ry < a)andb = aq + r2 (0 < rp < a). There 
are two useful things you can say about ry — r1. 
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EXERCISE 2.4.9 Appeal to the division algorithm to explain precisely why 
every integer is either even or odd, but not both. 


Divisibility and Prime Numbers 


Suppose a and b are nonzero integers and we apply the division algorithm to write 
b = aq +r. The situation where r = 0 motivates a new term. 


Definition 2.5.1 Suppose a and b are nonzero integers. We say that a divides b, 
written a|b, provided there exists an integer k such that b = ak. We also say that 
ais a divisor of b. If a| b where a ¢ {+1, +b}, we call a a proper divisor of b. If a 
does not divide b, we write a { b. 


Notice that if a does not divide b, then writing b = aq +r according to the 
division algorithm will imply 0 < r < a. First, here are some really easy results 
about divisibility for you to prove. 


EXERCISE 2.5.2 Ifaisanonzero integer, then a| a. 


EXERCISE 2.5.3 If a,b, and c are nonzero integers such that a|b and b|c, 
then a|c. 


An expression of the form mb + nc is called a linear combination of b and c. 
The next exercise says that if a divides both b and c, then it divides any integer 
linear combination of them. 


EXERCISE 2.5.4 If a, b,and c are nonzero integers such that a| b and a|c, then 
a| (mb + nc) for all integers m and n. 


EXERCISE 2.5.5  Ifaand b are nonzero integers such that a|b and b|a, then 
Eb. 


a=a 


Some important and useful results in algebra stem from what we call the great- 
est common divisor (gcd) of two nonzero integers a and b. First we must define 
what we mean by the gcd of two nonzero integers. We do this by stating two cri- 
teria that a gcd must satisfy, one motivated by the word common, and the other 
motivated by the word greatest. Then we show that such a thing exists uniquely 
and can be written in a somewhat surprising way. So that we will have uniqueness 
of gcd(a, b), we insist that any integer that hopes to qualify as gcd must be positive. 


Definition 2.5.6 Suppose a and b are nonzero integers, and suppose g is a 
positive integer with the following properties: 


17 See Exercise 2.2.7(f). 
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(D1) g|aand g|b 
(D2) If is any positive integer such that h|a and h |b, then h | g also. 


Then g is called a greatest common divisor of a and b, and is denoted gcd(a, b). 


Some remarks about Definition 2.5.6 are in order. First, nothing about the 
definition of gcd(a, b) (or of any term for that matter) guarantees that any such 
thing exists. It merely lays out criteria by which some positive integer earns the 
right to be called gcd(a, b). Second, property D1 states that any integer that might 
qualify as gcd(a, b) will in fact be a common divisor of a and b. Third, we need 
to explain our choice of property D2 as the other criterion, for it might seem a 
bit unnatural. If property D2 is supposed to describe what it means for g to be 
greatest of all the common divisors of a and b, you might think a more natural way 
to say it would be 


(D2’) Ifh is any positive integer such that h|a andh|b,thenh < g. 


Well, we could do it that way, but we don’t. Here’s why. It all centers around 
how you decide to measure greatness. In the integers, we can measure rela- 
tive greatness by <. But as you will see in your later work in algebra, there 
are other more abstract settings where we discuss divisibility, then ask about a 
gcd, but we do not necessarily have a notion of < to use as our criterion for 
greatness. The point is that it is better to develop a criterion for greatness of a 
divisor that stays within the context of divisibility rather than relying on some 
external measure like <, which might or might not already exist. All this is to 
say that property D2 is our measure of greatness among all common divisors 
of a and b. If g is going to qualify as gcd(a, b), it has the property that any 
positive integer h that comes down the pike also having property D1 cannot, 
in some sense, be greater than g. Our way of saying / is no greater than g is 
that h| g. 

In the past, you have probably found ged(a, b) by breaking a and b down into 
their prime factorizations to see how many 2s, 3s, 5s, and so on, that you can factor 
out of each. Then you built the gcd to contain just the right number of each prime 
factor represented. This might work well practically, but (1) we have no logical 
basis for this in the work we have done so far because we have not discussed prime 
numbers yet, and (2) it is not particularly useful as leverage in later theorems. 

So, given nonzero integers a and b, how do we even know that there exists 
some positive integer having properties D1-D2? And if such an integer does 
exist, how many can there be? The following theorem claims that gcd(a, b) exists 
uniquely. Furthermore, hidden inside its proof is some additional information 
about gcd(a, b) that comes in handy later. Most of the verification of the details is 
left to you as an exercise. 


Theorem 2.5.7 Suppose a and b are nonzero integers. Then there exists a 
unique positive integer g with the following properties: 
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(D1) g|aand g|b. 
(D2) Ifh is any positive integer such that h| a and h |b, then h | g also. 


Proof. Let a and b be nonzero integers, and define 
S={ma+nb:m,ne€Z,ma+nb > 0} (2.16) 


That is, S is the set of all positive integer linear combinations of a and b. First, S is 
not empty, for depending on the signs of a and b, we may let m, n = +1 to produce 
some positive value of ma +nb. By the WOP, S contains a smallest element g, 
which may be written in the form g = moa + nob for some integers mo and no. By 
Exercise 2.5.8, g has properties D1—D2, and if g; and g2 are both positive integers 
that have properties D1-D2, then g1 = go. 


EXERCISE 2.5.8 Let g = moa+nob be the smallest element of S as defined in 
Eq. (2.16). Show the following to complete the proof of Theorem 2.5.7. 


(a) g|a. (The proof that g | b is identical.)!8-19 


(b) If is any positive integer with the properties that h | a and h | b, then it 


must be that h | g. 
(c) If g, and g» are positive integers with properties D1-D2, then g; = g2.”° 


The proof of Theorem 2.5.7 yields a serendipity about gcd(a, b), in that it can 
be written as an integer linear combination of a and b, and the smallest such pos- 
itive expression is in fact the gcd. This can be particularly helpful if gcd(a, b) = 1, 
in which case we say that a and b are relatively prime. Immediately, we can see 
that if a and b are relatively prime, then there exist integers m and n such that 
ma+nb = 1. Furthermore, if it is possible to find an integer linear combination 
of a and b that equals 1, then clearly this linear combination is the smallest such 
positive linear combination. Thus a and b are relatively prime. 


EXERCISE 2.5.9 Show that 14 and 33 are relatively prime by determining 
integers m and n such that 14m + 33n = 1. 


Now we are ready to define prime numbers and investigate a few of their 
important properties. A positive integer p is defined to be prime if it has precisely 
two distinct positive integer divisors. If this is true, then these divisors must be 
1 and p. Note that this definition excludes 1 from being prime. 


Theorem 2.5.10 Suppose a is a nonzero integer and p is prime. Then either a 
and p are relatively prime or p|a. 


18 Apply the division algorithm to a and g and show that r > 0 is impossible. 
19 To suppose r > 0 leads to a contradiction because it produces an element of S that is smaller than g. 
20 Theorem 2.5.5 should come in handy. 
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Proof. Let a be a nonzero integer, and let p be prime. Since the only positive 
integer divisors of p are 1 and p, then either gcd(a, p) = 1 or gcd(a, p) = p. In the 
former case, a and p are relatively prime. In the latter case, p satisfies property 
D1, so that p|a. 


EXERCISE 2.5.11 Suppose a and b are nonzero integers, p is prime, and p|ab. 
Then either p|a or p|b.?! 


As an immediate corollary, by letting a= b in Exercise 2.5.11, we have the 
following. 


Corollary 2.5.12 If p|a*,then p|a. 


EXERCISE 2.5.13. Let a1, 42,..., a, be nonzero integers. 


(a) Suppose a is an integer such that a| aj; for all 1 < j <n. Show that a 
divides any integer linear combination of a, ..., dn. 

(b) Motivated by Definition 2.5.6 create a definition of gcd(a1, az, ..., Gn). 

(c) State and prove a parallel to Theorem 2.5.7. 


EXERCISE 2.5.14 Ifn > 3 isan odd integer, then 8 | (n? — 1). 


EXERCISE 2.5.15 If 7 is an integer such that n >2 and 3|(n—1), then 
3|(n2 — 1). 


21 See Exercise 1.2.18(h). 
22 Tf k is an integer, then either k or k +1 is even. 
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All of the results in Chapter 2 were about familiar sets of numbers, in particular 
their algebraic properties arising from the binary operations of addition and mul- 
tiplication. This chapter will also address properties of real numbers, but within a 
broadened context of their familiar subsets (N, W, Z, and Q) and sets in general. 


3.1 Set Terminology 


First, we return to the set terms from Chapter 0 and introduce several others. 
Given a set A in the context of a universal set U, we often illustrate with a Venn 
diagram, as in Figure 3.1. Shading a Venn diagram can sometimes be helpful to 
illustrate sets under discussion. 


Definition 3.1.1 Given aset A, we define the complement of A, denoted AC, in 
the following way. 


AC ={x:xeUandx ¢ A} (3.1) 
Thus x € AC if and only if x ¢ A. See Figure 3.2. 


Given two sets A and B, a general Venn diagram would look like that in 
Figure 3.3. We can now define several important binary operations to create new 
sets from A and B. 


Figure 3.1 Basic Venn diagram. 
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Figure 3.2 Venn diagram illustrating AC. 


( 


Figure 3.3 Generic Venn diagram with two sets, A and B. 


¢ 


Figure 3.4 Venn diagram illustrating AU B. 


Definition 3.1.2 Given two sets A and B, we define their union AU B in the 
following way. 


AUB={x:xeAorxe B} 
Thus x € AU Bifand only if x € A or x € B. See Figure 3.4. 


Definition 3.1.3 Given two sets A and B, we define their intersection AN Bin 
the following way. 


ANB={x:xeAandx ce B} 
Thus x € AN Bifand only if x € A and x € B. See Figure 3.5. 


Definition 3.1.4 Given two sets A and B, we define their difference A — B in 
the following way. 


A-—B=ANB° ={x:xeAandx ¢ B} 
Thus x € A — Bif and only if x € A and x ¢ B. See Figure 3.6. 
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Cc 


Figure 3.5 Venn diagram illustrating AN B. 


Figure 3.6 Venn diagram illustrating A — B. 


Figure 3.7 Venn diagram illustrating AA B. 


Cc 


Cc 


Definition 3.1.5 Given two sets A and B, we define the symmetric difference 
of A and B in the following way. 


AAB = (AN B®) U (AON B) = {x:xE€ AN B® orx € AT NB} (3.2) 


Thus x € AAB if and only if x € A — B or x € B— A. See Figure 3.7. Note that 
x € AAB if x is in precisely one of sets A and B. 


EXERCISE 3.1.6 Construct the negation of the following statements. 


(a) xe AC 
(b) xe AUB 
(c) xe ANB 
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(d) xe A-B 
(e) xe AAB 


Now we define some terms of comparison for sets. 


Definition 3.1.7 Suppose A and B are sets. We say that A is asubset of B, written 
A C B, provided that for all x, if x € A, then x € B. That is, 


(ACB) S (Wx)\(xe€ A> xe B)S (Wx € A)(xe€ B) (3.3) 


EXERCISE 3.1.8 Inthe spirit of Definition 3.1.7, construct a definition of A Cc B 
from its definition on p. 2. 


The relationship A C B can be displayed in the Venn diagram in Figure 3.8. 
This general diagram is not intended to imply that the region outside A but inside 
B contains any elements. 


Definition 3.1.9 If A and B are sets, we say that A = B provided A C B and 
BCA. That is, 


A=B#(ACB)A(BCA) i 
& (Wx)(x€ A> xe BYA (Vx)(x € B> xX € A) a 


EXERCISE 3.1.10 Construct a negation of the following statements. 


(a) ACB 

(b) ACB 

(c) A=B 

(d) AC BUC 
(e) ANBCCUD 


Figure 3.8 Venn diagram with A C B. 
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(f) For all sets A, B,and C,if AUC C BUC,then A C B. 
(g) For all sets A and B,if A C B,then AN B= A. 


Definition 3.1.11 Giventwosets A and B, we say that they are disjoint provided 
AN Bis empty. 


EXERCISE 3.1.12 If A and Bare not disjoint, then there exists an element in 
AN B. By negating this, construct an if-then statement that is equivalent to the 
statement that A and B are disjoint. 


EXERCISE 3.1.13  Inthisexercise, A and B represent sets, and x is an unspecified 
element. In each part of this exercise, you are presented with two statements, one 
of which is stronger than the other. State which one is stronger. 


(a) xEA xE€AUB 
(b) xEA xE€ANB 
(c) xE€A xEA-—B 
(d) xe AAB xEA-—B 
(e) ACB A=B 
(f) ACB ACB 


Proving Basic Set Properties 


Now we turn to proofs of basic set properties. At this point we fulfill the promise we 
made in Section 1.5 to provide an example of a theorem proved with a tautology. 
A theorem about set equality is a natural place to do this. We will set up the logical 
statements, then leave the truth table to you as an exercise. Then we will write a 
proof in paragraph form. Here is the theorem we will prove in these ways. Note 
that it is a universal statement about all sets A and B that satisfy A C B. We omit 
the universal quantifiers. 


Theorem 3.2.1 If AC B,thenAN B= A. 


In this set theorem, a Venn diagram can convince you that the theorem is 
true. If the Venn diagram is drawn as in Figure 3.8, then the fact that AN B= A 
is apparent. But Venn diagrams do not constitute a proof at this point in your 
mathematical career. You have a long way to go before you earn the right to say 
“Proof by picture.” 

Define the following logical phrases. 


p:xeéeA (3.5) 
q: xéEB (3.6) 
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How do we state the hypothesis of Theorem 3.2.1 in terms of these statements 
p and q?! Also, how do we state the conclusion AM B = A in the same terms?” 
The answers: 


hypothesis conclusion 
a —__——. mm 
(P>Q>(PAQ > PIAL > (PADI (3.7) 


EXERCISE 3.2.2 Construct a truth table for (3.7) to verify it is a tautology. 


EXERCISE 3.2.3 Using p and q as defined in (3.5) and (3.6), translate the 
following theorem into symbolic form and prove it with a truth table. 


If AUB=B,thenA CB. 


Now we present a proof of Theorem 3.2.1 in mathematical prose. To prove 
two sets are equal, we must show that each is a subset of the other. Furthermore, 
to show that one set is a subset of another, we must show that every element of 
the former is also an element of the latter. Proofs of this sort are often called 
element-chasing proofs. They show two sets are equal by chasing an arbitrarily 
chosen element from one side to the other, then back. 


Proof of Theorem 3.2.1. Suppose A C B.Toshow AN B= A,show that AN BC 
AandACANB. 


(AN BC A): Pick x € AN B. Then x € A and x € B. Since x € A, we have that 
ANBCA. 

(A C AN B): Pick x € A. Then, since A C B, we have that x € B also. Therefore, 
since x € A and x € B, it follows that x ¢ AN B. Thus A C ANB. 


Since we have shown that AN B C A and A C AN B, we have demonstrated that 
ANB=A. 


This proof of Theorem 3.2.1 is intentionally wordier than it needs to be, so that 
you can see how its structure derives from the definitions of the terms it uses. A 
better, more succinct version would read as follows. 


Proof of Theorem 3.2.1 (Succinct). Suppose A C B. 


(Cc) Pick x € AN B. Then x € A,so that AN BC A. 


(>) Pick x € A. Then since A C B, it is also true that x € B, so that xe ANB. 
Thus A CAN B. 


Since AN BC Aand A CAN B,wehave that ANB=A. 


! See Definition 3.1.7. 
2 See Definitions 3.1.3 and 3.1.9. 
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Now we present a few more examples of set theorems and proofs, in order to 
set you up for the exercises to follow. 


Theorem 3.2.4 (DeMorgan’s Law). If A and B are sets, then 
(AN B)© = ACU BO (3.8) 
Proof. 


(<):Pick x € (AN B)©.Thenx ¢ AN B,sothateitherx ¢ Aorx ¢ B. We consider 
each case. 

(Case x ¢ A): If x ¢ A, then x € AC. Thus x € A©U BS. 

(Case x ¢ B): If x ¢ B, then x € B©. Thus x € A© U BS. 


In either case, we have that x ¢ A© U B®, so that (AM B)© Cc ASU BE. 

(>): Pick x € A© U B®. Then either x € A© or x € B©. We consider each case. 
(Case x € A©): If x € A, then x ¢ A. But if x ¢ A, then certainly x ¢ AN B. 
Thus x € (AN B)S. 

(Case x € BC): If x € BS, then x ¢ B. But if x ¢ B, then certainly x ¢ ANB. 

Thus x € (AN B)°. 


In either case, we have that x € (AN B)©,so that AC U B© C (ANB). 


Since (AN B)© C AC U BC and (AN B)© D A© U BC, we have that (AN B)© = 
ACU BE, 


The fact that the empty set contains no elements can make for some interesting 
twists in proofs. 


Theorem 3.2.5 For any set A,AU’= A. 
Proof. 


(C): Pick x € AU. Then either x € A,orx € Y. But since % contains no elements, 
it must be that x € A. Thus AU®C A. 


(>): Clearly, if x € A, then x € AUG. Thus A C AUG. 


Therefore, A U@ = A. 


The next theorem is natural to prove by contrapositive. 


Theorem 3.2.6 If AC B,thenA—B=49. 
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Proof. Suppose A — B is non-empty. Then there exists x € (A — B). Thus x € AN 
B®, so that x € A and x € B©. But the existence of such an x is precisely the 
negation of Definition 3.1.7,so that A ¢ B. 


Proof by contradiction can come in handy, too. 
Theorem 3.2.7 If Ais any set,@ C A. 
Proof. Suppose there exists a set A such that # ¢ A. Then there exists x € 4 such 


that x ¢ A. But % contains no elements. This contradicts the definition of 4. Thus 
OCA. 


EXERCISE 3.2.8 State the converse of Theorem 3.2.1 and prove it. 


EXERCISE 3.2.9 Prove the following. 


(a) For all sets A and B,if A C B,then AUB=B. 

(b) IfAC BandBCC,thenACC. 

(c) ANAT =@. 

(d) If A C B,then A© D> BS. 

(e) (DeMorgan’s Law) For all sets A and B, (AU B)© = ACN BS. 
(f) If A and B are disjoint, then AAB = AU B. 

(g) IfACCandBCC,thenAUBCC. 

(h) IfC C AandC C B,thenC CANB. 

(i) IfAC BandCCD,thenANCC BND. 

(j) If AC BandC CD,thenAUCC BUD. 


EXERCISE 3.2.10 State and prove the converse of Exercise 3.2.9(f).° 
EXERCISE 3.2.11 Prove that distributes over U and vice versa. 


EXERCISE 3.2.12 Sometimes we can prove certain sets are equal without hav- 
ing to chase elements back and forth, by appealing to earlier theorems we have 
proved. By making appropriate references to certain results from Exercise 3.2.9, 
prove the following are true for all sets M, N, S, and T. 

(a) IfM=NandS=T,thenMNS=NNT. 

(b) If M@=NandS=T,thenMUS=WNUT. 

(c) If M=N, then M© = N°. 


3 Try proof by contrapositive or contradiction. 
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Notice that Exercise 3.2.12 says that set intersection, union, and complement are 
well-defined operations on sets. 


EXERCISE 3.2.13 Show that union and intersection do not allow cancellation 
by providing counterexamples to the following statements. 


(a) If AUC= BUC, then A= B. 
(b) If ANC = BNC, then A = B. 


EXERCISE 3.2.14 If X and Y are disjoint sets, we sometimes write X U Y as 
X UY. This is a way of talking about the set X U Y by tagging it with a little symbol 
(the dot) that tells the reader the additional information that X and Y are disjoint. 
So if you see a statement like 


AUB=AU(B- A) (3.9) 
it is equivalent to the compound statement 
AUB=AU(B—A) and AN(B-—A)=9% (3.10) 


Prove Eq. (3.9) by showing both parts of (3.10). 


Families of Sets 


If you’re working with only a few sets at a time as we did in Section 3.2, it’s probably 
sufficient to use A, B, C,and so on, to represent them. If you have a set of, say, 10 
sets (generally called a family or collection of sets instead of a set of sets), it might 
be more sensible to put them into a family and address them as Aj, A2,..., Ato. 
In a case like this, we would say that the set {1,2,3,..., 10} indexes the family of 
sets. If we write 


Ne S119 sal 


then we could write a family of n sets as 
{Ay, Az, A3,.--, An} = {Ag 2k © Na} = {Arhear 


and we would say that N, is an index set for the family of sets. This notation has 
advantages, for then we could write unions and intersections more succinctly: 


n 
AU Ay U A3U---U An = (J Ag (3.11) 
k=1 


n 
A, A2N A3N- +» An =) Ax (3.12) 
k=1 
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In Definitions 3.3.3 and 3.3.4, we will define precisely what we mean by this sort 
of union and intersection. 

We can go even further. It is conceivable we might need to work with infinitely 
many sets {A;, Az, A3,...} that we might want to index with the positive integers. 
For example, if we use the familiar interval notation 


[a,b] = {x:x Ee Randa<x<b} 


we might talk about the family of intervals F = {An}nen, where An = [0, 1/n]. To 
form the union or intersection of a family of sets indexed by the positive integers, 
we could use notation like that in Eqs. (3.11) and (3.12). 


[o.@) [o,@) 
[J4n and [()An (3.13) 


or we could write something like 


[J An and {)} An (3.14) 


neN neN 


where by (3.14) we understand that n is allowed to take on all values of the indexing 
set of natural numbers. 

The notation in (3.14) is handy when the indexing set is more complicated than 
the positive integers and does not allow us to think of some index variable n starting 
at 1 and progressing sequentially off to infinity. For it is conceivable that any set A 
can index a family of sets. We can then address individual sets in the family as Ag, 
where a € A, and denote the family F = {Ag}yc.4. Union and intersection could 


then be written as 
U Aw and () Ag 
aceA acA 


Example 3.3.1 One important contribution of the German mathematician 
Richard Dedekind (1831-1916) is a rigorous foundation of the set of real numbers. 
He employed what is now called a Dedekind cut, whereby the set of rational num- 
bers is “cut” into two pieces. Specifically, for a real number r, Dedekind worked 
with sets of the form 


Ay = {x:x € Qandx <r} (3.15) 
B, = {x:x € Qandx >r} (3.16) 
For example, although V2 is irrational (as you will see in Section 3.10), it certainly 


makes sense to talk about A /j, the set of all rational numbers less than /2. The 
real numbers form the index set for the families {A;},cp and {B;},cr. 
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Example 3.3.2 Consider the family of intervals 
F ={[-nr]:ré€R* (3.17) 


where R™ denotes the positive real numbers. The index set is R*, and we might 
want to use the notation /, = [—r,r] to represent one interval in the family. We 
might also write F = {/;},cr+. I 


Of course, we don’t have to worry about having our family of sets so well 
organized as to be indexed by any set at all. In the same way that we talk about a 
set A and address an arbitrary element x € A, we can call our family of sets F and 
address an arbitrary set in the family as A € F. Then the union and intersection 
of the sets in F could be written as 


[JA and [()A (3.18) 


AcF ACF 


or more simply as 


[JA and [()A (3.19) 
F F 


The statements in (3.18) and (3.19) assume the least structure on the family of sets, 
so they are a good general notation. 

Before we dig into families of sets, we’re going to construct an extended 
metaphor that will help carry us through the complexities of the ideas and nota- 
tion. Families of sets tend to confuse students at first, and for several reasons. First, 
when we talk about a family, we are actually talking about a set whose elements 
are sets. So it might be that x € A and A € Ff, but writing x € F is wrong, for x is 
not an element of F, but an element of an element of F. Second, when a family 
is indexed, the indexing set A is yet another set to contend with, whose elements 
a serve as something like name tags by which sets in the family are addressed. So 
let’s stage a little production whose cast of characters all represent sets and terms 
we have mentioned in this section. Then as we state theorems, the metaphor will 
help us understand what is being said and how to approach the proof. 

Suppose we are studying the mathematics section of the catalog of Prestigious 
University. Every mathematics class has a number, so let 


A = {104, 210, 211, 212, 330, 360, 430, 431, 510, 531} (3.20) 


and think of A as the set of all the course numbers of mathematics courses offered 
at PU. This set of course numbers will be our index set. We address an arbitrar- 
ily chosen course number as a € A. Let Ag be the roster of all PU graduates 
who passed course number a while they were students at PU. We will address 
an arbitrarily chosen student on roster Ay as x. With these role assignments, 
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F = {Aa}acA is the set of all these rosters of students who passed the individual 
courses. That is, 


F = {Ajoa, Azio, Azi,.--, A531} (3.21) 


Writing the family of rosters as F = {Ay}ye.4 references each roster in the family 
in terms of the course number that tags it. Writing F simply as {A} dispenses with 
the number tags and addresses a particular roster in F simply as A. 

Certainly, the family of sets F and its indexing set A need not look anything 
like the ones we have created here. But thinking of A as in Eq. (3.20) and ¥ as in 
Eq. (3.21) should not limit us or make our proofs less than generally applicable 
if we use the metaphor as a tool to help clarify our thinking. Just remember that 
we will address sets in the family in one of two ways. To illustrate the first way, 
sometimes we will pick some arbitrary a € A inorder to talk about Ag, whichis like 
choosing an arbitrary course number and talking about the roster for the course 
with that number. Or perhaps we will claim the existence of some specific ag € A 
in order to talk about A,,. In our metaphor, this is the claim that there is a certain 
course number whose roster has some property. To illustrate the second way of 
addressing sets in the family, sometimes we might pick an arbitrary A € ¥, which 
is like choosing an arbitrary mathematics roster without any specific reference to 
a course number. Or perhaps we will claim the existence of some specific Ag € F. 
This is the claim that there is a particular mathematics course roster that has a 
certain property, without making any reference to the course number. 

Having created this little metaphor as an aid to our understanding, let’s define 
more terms and derive some results, using the metaphor to motivate and get us 
over some humps. First, how should we define the sets UgcerA and N4crA? Or 
equivalently, how should we define the following statements?4 


xe|JA and xef)A (3.22) 
F F 


We want U,4<FA to be the set you get when you take each A in F and dump all 
its elements into a single set. For our metaphor, Uye4Aa, Or Uae FA, however 
you choose to write it, is the single roster of graduates created by unioning all 
the individual mathematics class rosters. Maybe you can see that UzA consists of 
all graduates who have ever passed a mathematics class at PU. So if x represents 
a graduate, saying x € UtA means that there is some mathematics course at PU 
that x passed. We can write this in two ways, with or without a reference to the 
course numbers: 


x€ [J Au @ (Qa € A)(x € Ag) (3.23) 
aceA 


4 Think in terms of J for the union and Y for the intersection. 
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xe |) A GAe Fyre A) (3.24) 
AcF 


The form of (3.23) says there exists a course number a € A such that graduate x 
is on the roster of mathematics course number a. The form of (3.24) says simply 
that there is a mathematics course roster on which the name of graduate x appears, 
without any reference to a course number. With this, we arrive at the following 
definition. 


Definition 3.3.3 Let F be a family of sets indexed by A. Then the union over 
F is defined by 


L) Aa = {x : Ga € A)(x € Aa)} (3.25) 
acA 


Without reference to the index set, this becomes 


|) A={x: GA € F)(we A)} (3.26) 
AcF 


Now what does Nye. 4Aq (or N4eFA) correspond to in our metaphor? This is 
the intersection of all rosters, so it is the list of all graduates who passed all mathe- 
matics courses at PU. So what does it mean mathematically to say x € Nye 4Aq (or 
x ENA)? 


xé { ) Au @ (Wa € A)(x € Ag) (3.27) 
acA 

xe ()A@WA€F)(xE A) (3.28) 
AcF 


With this we arrive at the following definition. 


Definition 3.3.4 Let F be a family of sets indexed by A. Then the intersection 
over F is defined by 


() Aa = {x: (Va € A)(x € Ag)} (3.29) 
acA 


Without reference to the index set, this becomes 


() A=(x: WA€ F)(x€ A)} (3.30) 
ACF 


Many of the results we proved in Section 3.2 for a family of two or three sets 
carry over to analogous results for a family of sets of any size. So as you work 
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your way through the rest of this section, note that some of the theorems are 
generalizations of results from Section 3.2. 


Example 3.3.5 Construct the negation of the statement x € UrA. 
Solution By negating the statement in (3.23), to say x ¢ Ur A means 

(Wa € A)(x ¢ Ag) 
Without making reference to the index set, we use (3.24) to have 

(VA € F)(x ¢ A) a 

In our mathematics class metaphor, what does x ¢ Uz A mean? Graduate x is 

not on the universal roster of mathematics classes, so x never passed a math- 
ematics class at PU. That is, for every course number a € A, x did not pass 
the mathematics course numbered a, or, for every roster A € F, x is not on 


roster A. 


EXERCISE 3.3.6 Construct the negation of the statement x € NA intwo forms. 
What does x ¢ N¢A mean in the context of the mathematics class metaphor? 


Theorem 3.3.7 (DeMorgan’s Law). Suppose F is a family of sets. Then 


[U4] =()a° (3.31) 


F 


Proof. 
(<): Pick x € [Ux A]©. Then x ¢ UFA. Therefore, for all A € F, x ¢ A. But then 
x € A© for every A € F. Thus x ENA. 


(>): Pick x € N¢A©. Then x € A© for every A € F. Thus, x ¢ A for every A € Ff, 
so that x ¢ U¢A. Therefore x € [Ur A]. 


To see what Theorem 3.3.7 says in our metaphor, we need a universal set in 
which the sets in the family are meaningful. Let U be the roster of all PU grad- 
uates. With this, what set is being talked about in Theorem 3.3.7, and what are 
the two ways of constructing it in Eq. (3.31)? To construct the left-hand side 
of Eq. (3.31), we first combine all the mathematics rosters into a single roster 
and then take all the PU graduates except these. This is the list of all PU grad- 
uates who avoided mathematics altogether while they were at PU. To construct 
the right-hand side of Eq. (3.31), first we take each mathematics class roster and 
consider the complement. For example, AS is the list of all PU graduates who 
did not pass Mathematics 211. By taking the intersection of all these comple- 
ments, we arrive at the list of PU graduates who avoided Math 104 and Math 210 
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and ... and Math 531; that is, the list of graduates who avoided mathematics 
altogether. 


EXERCISE 3.3.8 [DeMorgan’s Law] Suppose F is a family of sets. Then 


IN a] =(Jae (3.32) 


F 


For the next theorem, we prove part 2 here and leave part 1 to you as an 
exercise. 


Theorem 3.3.9 Let F be a family of sets indexed by A, and suppose B C A. 
Then, 


1. UpgepAg © Use sAa 
2. NpeBAp > Nac AAa 


Before we prove part 2 of Theorem 3.3.9, let’s see how it relates to our 
metaphor. Since A is the set of all mathematics course numbers at PU, it might 
work to think of 6 as the set of all course numbers of lower level mathematics 
courses at PU. So 


B = {104, 210, 211, 212} (3.33) 


With that, what does part 2 of Theorem 3.3.9 say? The set construction on the left- 
hand side is the intersection of the class rosters across all the lower level courses, 
while the right-hand side is the intersection of the rosters of all the mathematics 
classes. So if we pick some x € Nge4 Aa, then x is a PU graduate who passed every 
mathematics course offered. So certainly x passed all the lower level courses. 
Here is the proof. 


Proof of part 2. Pick x € Nuc 4Aa. We must show that x € Ag for all 6 € B, so 
pick 6 € B. Since B C A, it follows that 6 ¢ A. Therefore x € Ag, because x € Ag 
for any a € A. Since 6 was chosen arbitrarily, we have shown that x € Ag for all 
B € B,so that x € NgeBAs. 


EXERCISE 3.3.10 Prove part 1 of Theorem 3.3.9. What is this statement saying 
in the context of the mathematics class metaphor? 


We can write Theorem 3.3.9 in a slightly different form if the family of sets is 
not indexed. If F; is a family of sets and Fy C Fj, we call Fy a subfamily of Fy. 
Since B C A in Theorem 3.3.9, {Ag}gep is a subfamily of {Ag}ac.4. Swapping the 
notation in Theorem 3.3.9 for an arbitrary family 7; and a subfamily 72, we have 
the following. 


Theorem 3.3.11 Suppose F; is a family of sets, and F> is a subfamily of Fj. 
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Then 
1. Up, A CUPF,A 
2: NpADNF,A 


The next theorem involves two families of sets, both indexed by A, where 
corresponding sets in the two families are related by subset inclusion. To under- 
stand what it says, think of By, as the set of all female PU graduates who passed 
mathematics course number a. 


EXERCISE 3.3.12 Suppose Fi = {Aw}ye4 and Fo = {Bu}we,4 are two families 
of sets with the property that By C Ag for every a € A. Then 


(a) Une Ba S Uses Ac 
(b) Maca Bu S Maes Aa 


EXERCISE 3.3.13 Suppose F = {A} is a family of sets, and suppose C is a set 
for which A C C for every A € F. Show that UrA C cS 


EXERCISE 3.3.14 Suppose F = {A} is a family of sets, and suppose D is a set 
for which D C A for every A € ¥. Show that D C nA? 


The Principle of Mathematical Induction 


In the world of mathematics, the well-ordering principle (WOP) is often taken 
as an axiom. In this section, we derive a theorem based on the WOP called the 
Principle of Mathematical Induction (PMI). To write a proof by induction, the 
imagery is that we have an infinite row of dominoes that we must knock down. 
First, we show figuratively that we can knock the first domino down. Then we 
show that if the nth domino falls, then so does the (n + 1)st. This very important 
proof technique is useful when the theorem you're trying to prove has a form like 
any of the following examples. 


Example 3.4.1 For any positive integer n, 


n 
1 
(iota Se 


3.34 
2 Ce) 
k=1 


Think of C as the set of all PU graduates who ever enrolled in a mathematics class. 
Think of D as the set of all PU mathematics majors (whom we will assume would have taken every 
mathematics course) who graduated with a 4.00 GPA. 
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Example 3.4.2 For any positive integer n, 


n(n + 1)(2n + 1) 


; (3.35) 


n 
742743 4-47 =) P= 
k=1 


Example 3.4.3 Let A be a given set, and {B,}/_, a family of n > 1 sets. Then 


AU (BLN Bo M+++ A Bn) = (AU By) N (AU By) A+++ (AU Bn) (3.36) 
= 


Example 3.4.4 Suppose there aren > 1 people in a room, and everyone shakes 
hands with everyone else exactly once. Then the number of handshakes that takes 
placeisn(n—1)/2. 


Example 3.4.5 Let G be a connected, acyclic graph on n vertices, where n > 1. 
Then G hasn—Jledges. & 


Notice that Examples 3.4.1-3.4.5 all make a statement about a finite but 
unspecified n number of things, and you want to prove that the claim is true for 
anyn > 1. 

You might have found Eq. (3.34) handy if you had been in grammar school 
with Carl Friedrich Gauss in the 1780s. Gauss, a very precocious child, showed 
amazing mathematical ability at a very early age. A somewhat embellished story 
goes that when Gauss was eight years old, it was raining one day during recess, 
Internet access was down, and his teacher needed to keep the children in the class 
busy for a while. So he told them to add up the first 100 natural numbers without 
their calculators. Gauss figured out how to get the result quickly in the following 
way. By writing the sum twice, once in reverse order, he added vertically, term 
by term: 


1 De. Bee Ae 9 100 
100-004 OS. CP Aone. De. | 
101 + 101 + 101+ 101+---+ 101+ 101 
a 


100 terms 


Gauss observed that 100 x 101 is twice the desired result, so he quickly reported 
the result of 5050. If you perform a similar trick replacing 100 with an arbitrary n, 
you get Eq. (3.34). 

Though Gauss’s technique might seem sufficient as a proof, there is something 
a little sloppy about making a claim that involves a “dot dot dot” in it. The PMI is 
a theorem derived from the WOP that eliminates this untidiness. 
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So what is the PMI? And what does the WOP have to do with it? Let’s conduct 
a thought experiment to set it up, first in its standard form. Then in Section 3.5 we 
will look at some variations. 

Suppose we consider a set S, which is assumed to be a subset of the positive 
integers, and suppose S is known to have the following properties: 


(11) les 
(12) Ifn >landneS,thenn+1eS. 


Then what precisely is S? Now I1 says 1 € S, but then 12 applies to guarantee 
that, since 1 € S, then 1+ 1=2 eS. But then 12 applies again to guarantee that 
2+1=3€ S,andsoon. NowS C Nis assumed, and it appears that every positive 
integer is also in S,so that N C S. So surely S = N. Fair enough. But this argument 
has its own “dot dot dot” and is a bit less than rigorous. 

Another way to look at this same argument is still a little open ended, but 
comes closer to the actual proof as we will present it. If it were true that § A N, 
then since § C N,it must be that N ¢ S. Thus there exists k ¢ Nsuchthatk ¢ S. By 
11,k £1,so thatk — 1isa positive integer. By the contrapositive of 12, sincek ¢ S, 
then k — 1 ¢ S either. Therefore k — 1 4 1,so thatk —2 € N,andsincek—1¢ 5S, 
12 implies k — 2 ¢ S. Here’s where the “dot dot dot” comes in, and we have that 
k,k —1,k—2,...¢ 8. But this crashes head-on into the fact that 1 € S. 

If we look to the WOP, then we can clean up these arguments and find in 
the WOP enough strength to provide a rigorous proof that any subset of the 
positive integers that has properties I 1-I 2 must, in fact, be the entire set of positive 
integers. This is the Principle of Mathematical Induction. 


Theorem 3.4.6 (PMI). Suppose S is a subset of the positive integers with the 
following properties: 


(I1) les 
(12) Ifn>landne S,thenn+1eS. 


Then S=N. 

The proof of the PMI is a great example of proof by contradiction. If you would 
like to try to prove it yourself, then take a glance at the hints’:8-?-!0-!! below if 
you need to. Here is the proof. 


Proof. Suppose S is a subset of the positive integers with properties 1-12. Sup- 
pose also that S 4 N. Then N ¢ S. Thus there exists n € N such that n ¢ S. If we 


7 The theorem says [((S CN) AIL A12] > (S = N). Suppose this is false. 
State N ¢ Sin 3 form. 
9 Let T = N-— S. What does the WOP say about T? 
10 Tf T has a smallest element a, what can you say about a — 1? 
ll Butifa—le S, then what is true of a? 


lo.) 
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define T= N—S =NNS°,then T C Nandn € T. Thus T is a non-empty subset 
of the positive integers, which, by the WOP, contains a smallest element a. Now it 
is impossible that a = 1 because 1 € S.Thusa > 1,so thata — 1 € N. Furthermore, 
since a is the smallest element of T,it must be thata —1 ¢ T. Thusa—1 € S. But 
by 12, since a—1 € S, it follows thata—1+1=aeS. Butifae S, thena ¢ T. 
This is a contradiction. Thus there is no smallest element of 7, which means that 
T is empty. Therefore, N C S, whence S = N. 


What does Theorem 3.4.6 have to do with proving the claims in Examples 3.4.1- 
3.4.5? Think of these examples as statements about the positive integers. They 
say effectively that a certain formula or statement is true for all n > 1. To be as 
general as possible, address such a statement by P(n). Let’s define S to be the 
set of all positive integers n for which the statement P(n) is true. The trick is to 
show that S has properties 11-12. Then the PMI will allow us to conclude that 
S = N, which is the same as saying P(n) is true for all n. First we show that 1 € S by 
showing P(1) is true. This is called establishing the base case. Then we show that 
néS=>n-+1€S bysupposing P(n) is true, and using this to show that P(n + 1) 
is true. The assumption that n € S is called the inductive assumption, and the part 
of the proof where we show that n € S implies n+ 1 € S is called the inductive 
step. Having shown that S has properties I 1-12, we then conclude S = N; that is, 
P(n) holds true for all n. Here is a sample theorem. 


Theorem 3.4.7 (Sample). For all positive integers n, P(n). 
Proof. We prove by induction on n > 1. 


(11) P(). 
(12) Suppose n > 1 and P(n). Then... .Thus P(n + 1). 


Therefore, by induction, P(n) is true for all n > 1. 


There will be some point in step I2 where you use the inductive assumption 
P(n) to get over the hump of showing P(n + 1). In the proof of the next theorem, 
we rewrite a sum in the following way: 


n+1 n 
oe ak = (4 a2 +--+ + an) + On41 = (> a) + dn+1 (3.37) 
k=1 k=1 


Theorem 3.4.8 For alln > 1, 


a r= may Dens’ (3.38) 
k=1 


Proof. We prove by induction on n > 1. 
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(11) For the case n = 1, we have 


n 1 
Yor =P HP H=15=104+12-14+)/6=nn+12n+1)/6 
k=1 k=1 


so that Eq. (3.38) holds true for n = 1. 
(12) Suppose n > 1 and )vt_, k? =n(n + 1)(2n +1)/6. Then 


n+1 n 
r= +n (ie BOE NCEEY n+1)2 
2 X +(n+1) z +(n+1) 
2n> + 3n*+n+6(n2?+2n4+1)  2n3+4+9n? +13n4+6 
= 6 = 6 
— @tln+2)2n4+3) @tDi@41)+1Rat+) +1 
6 6 


Thus Eq. (3.38) holds for n + 1. 
By the PMI, it follows that Eq. (3.38) holds for all n > 1. 
EXERCISE 3.4.9 Prove the following sum formulas for n > 1. 
(a) Dhak = 
(b) Cha Di = (preg 
(©) Dh Qk-1) =n? 

2 

(4) Deer = (Li 4) 


n 1 
(e) dopa1 Kk+1) — ai 


Before we prove the claim in Example 3.4.3, we need to make an observation. 


In Exercise 3.2.11, you showed that 


AU(BNC) =(AUB)N(AUC) (3.39) 


This is the special case of Eq. (3.36) where n = 2. We need this fact in proving the 


inductive step below. 


Theorem 3.4.10 Suppose A is a given set and {Bx}f_, is a family of n > 1 sets. 


Then 


AU (BLN BoM +++ By) = (AU By) N(AU By) N+ ++ (AU By) (3.40) 
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Proof. To clean up the notation, we write Eq. (3.40) as 
AU [M1 Be] = M4 (A U By) (3.41) 


(I1) Ifn =1,thenM}_, By = Bi,so that both sides of Eq. (3.41) are simply A U By. 
(12) Suppose n > 1 and Eq. (3.41) holds for n. Then 


AU[MES Be] = AU [(MLy Be) O By4t] 


by 3.39) [A U (Mg Be)] N (A U Bn41) 


= [NPL, (AU BY] N (AU Bayt) = NET (A U Be) 


Thus, Eq. (3.41) holds for n + 1. 


By the PMI, Eq. (3.41) holds for all n. 


EXERCISE 3.4.11 Suppose A is a given set and {B;}7?_, is a family of n > 1 sets. 
Then 


AN (By U By U+++U By) = (AN By) U (AN By) U- > U(AN By) (3.42) 


EXERCISE 3.4.12 Suppose Aj, Az, A3, ... are sets with the property that A, C 
An+1 for every n > 1. Use induction to show that A, © A, for all n. 


Now we prove the claim in Example 3.4.4. 


Theorem 3.4.13 Suppose there are n people in a room, and everyone shakes 
hands with everyone else exactly once. Then the number of handshakes that takes 
place is n(n — 1)/2. 


Proof. 


(11) If there is one person in the room, then there are zero handshakes. But 
1d — 1)/2 = 0,so0 the result is true for the case n = 1. 


(12) Suppose n > 1 and that in any room of n people there will be n(n — 1)/2 
handshakes. Now suppose there are n+ 1 people in the room. Remove 
one person from the room, so that n people remain. If all these people 
shake hands with each other, then by the inductive assumption, there will 
be n(n — 1)/2 handshakes. Now bring the removed person back into the 
room. When this person shakes hands with everyone, there will be n more 
handshakes. Thus the total number of handshakes among all n + 1 people is 


DUES SD) 3s n2—n 2n  n?+n_ (n+1)n 
2 2 2. a 2: 


84 


Chapter 3 Sets and Their Properties 


Thus the result is true for the group of n + 1 people. 


By the PMI, it follows that the result is true for all n > 1. 


EXERCISE 3.4.14 The Tower of Hanoi game consists of three pegs, one of which 
holds a stack of n disks, and where the disks decrease in size from the bottom to 
the top of the stack. See Figure 3.9. The object of the game is to move the entire 
stack of disks from its current peg to either of the other two pegs. The rules of the 
game are that the disks may be moved from peg to peg only one at a time, and at 
no point may a disk sit on top of a smaller disk. Show that the object of the game 
may be achieved on a stack of n > 1 disks in 2” — 1 moves. 


EXERCISE 3.4.15 Suppose n is a positive integer and a, bj, b2,..., by are real 
numbers. Then 


a( dar bk) = pas (abe) (3.43) 

Theorem 3.4.16 Suppose m and n are positive integers and let aj, a2,..., am 
and bj, b2,..., b, be real numbers. Then 

(se 71 a4) (Sekar be) = DE (har jhe) (3.44) 


If m = n = 2, this is the FOIL technique of multiplying two binomials. 


Proof. First we apply Exercise 3.4.15 by distributing }°7_, b, over the terms in 
the sum )°"_, a; to have 


(ered aj) (Sea br) = veilaj (es bx)] 


Then we apply Exercise 3.4.15 again by distributing each a; over the terms in the 
sum )~;_, by to have 


Figure 3.9 The Tower of Hanoi. 
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EXERCISE 3.4.17. For real numbers a1, a9,..., ee Pree | ay.| < ye lael. 


EXERCISE 3.4.18 Foralln > 1,3| (4" —1). 


EXERCISE 3.4.19 Suppose a1, a2, ..., @, are nonzero integers and p is a prime 
number. Suppose also that p | ajaz---a,. Then there exists some k (1 <k <n) 
such that p | ax. 


Variations of the PMI 


There are several useful variations of the standard PMI from Section 3.4. First, in 
defining S on p. 80, there was nothing magical about making 1 the first element 
of S. We could have supposed that 7 is any integer and that S is a subset of the 
integers with the following properties. 


(J1) jeS. 
(J2) Ifm> jandne S,thenn+1eS. 


By mimicking almost word for word the proof of Theorem 3.4.6, we have the 
following. 


Theorem 3.5.1 (PMI, Version 2). Suppose S is a subset of the integers that has 
properties J1-J2. Then S = {j,j+1,j+2,...J={(neZ:n> j}. 


If j = 0, we have a PMI that allows us to prove results are true for all nonneg- 
ative integers. The proofs are identical in principle to those in Section 3.4, but the 
induction arguments are rooted at n = 0 instead of n = 1. 


EXERCISE 3.5.2 Prove the sum formula )7"_, 2* = 2"+! —1. 
Let x be a nonzero real number, and define the following. 


eet 
(3.45) 
xl". forn>0 


This is the standard way of defining exponentiation. Instead of saying something 
like x? = x---x nine times, we say x? = x8. x, and we define nonnegative inte- 
ger powers of x recursively. This definition of exponentiation has some familiar 
behaviors. Notice that the next theorem deals only with nonnegative exponents. 
We will prove one result and leave the others to you as an exercise. 


Theorem 3.5.3 Let a and b be nonzero real numbers, and let m and n be 
nonnegative integers. Then 


a” a" =a" (3.46) 
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(a®)" = a™ (3.47) 
(ab)"” = ab" (3.48) 


Proof of Eq. (3.46). We prove by induction on n > 0 (thinking of m as fixed). 


(J1) For the case n = 0, we have a” -a° = a” -1 =a" = a"*°. Thus the result is 
true forn = 0. 


(J2) Suppose n > 0 and a” - a” = a”*". We show that a” - a"! = a@™ tr), 


Gag ao ia aye i aexe 


=q"*".q= qQ@tn+l = grtath 


Thus by induction, a” - a" = a’"*" for all nonzero a and nonnegative integers m 
: S 
and n. 


EXERCISE 3.5.4 Prove Eqs. (3.47) and (3.48). 


Now we can define x” forn < 0. For nonzero real number x and positive integer 
n, define 


yee (3.49) 


Notice in the next exercise how we must derive the rules for negative exponents 
very carefully from the same rules for nonnegative exponents. No induction 
arguments are necessary in this exercise, but merely careful attention to the 
manipulation of exponents as allowed by Theorem 3.5.3. 


EXERCISE 3.5.5 This exercise will show that the rules for exponents in 
Eqs. (3.46)—(3.48) hold for all integer exponents, positive, zero, or negative. In 
all parts of this exercise, a and b are nonzero real numbers, and m and n are 
nonnegative integers. Prove the following. 

(a) a’-a"=1 

(b) (a”)~1 _ (a~1)" 12 

(c) qi” ; a” = qin 

(d) a™a7" = qn 13 

(e) (a—™)" — q mm 


(f) (a")-" =ao™ 


12 This should be a mere observation from part (a). 
13 Take a hint from a’. a~® = a2.a5-a~® = a? = a8 and a? -a78 = a? -a~*. -a~® = a = a8. 
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(g) (a~™)” _ qQ 
(h) (ab) " =a "hb 


EXERCISE 3.5.6 Suppose a is a nonzero real number and n is a nonnegative 
integer. Prove the following.'* 


(a) (—a)2" = Qn 
(b) (—a)2#+1 = —g2ntl 
EXERCISE 3.5.7 Prove the following for n > 1. 


(a) If0 <a <b,thena’ <p"! 
(b) Ifx > 1,thenx” > 1 
(c) Ifa <b<0,then 


(i) qe > pn 16 


(ii) qtl < pentl 
(d) Ifa <0 <b,thena"+! < prtl! 
Another example of a recursive definition is the factorial. 


O!=1 
(3.50) 
(n+1)!=(n+1)-n! forn>0 


EXERCISE 3.5.8 Prove the following for n > 0. 


(a) Ye=o (aan =1 my 

(b) reo k kL =(2+D!-1 

EXERCISE 3.5.9 Define a sequence recursively by letting aj = 0, and defining 
n 


for all n > 0. 


An41 = 44, +1 forn > 0. Then ay = rel 


EXERCISE 3.5.10 By Theorem 3.4.16, polynomial multiplication will reveal 
that 


A—x1+xt+xr4¢r4¢x4=1-9 


14 You can prove these without induction if you exploit previous exercises appropriately. 
15 Th the inductive step, separate the cases a = 0 and a > 0. Look to Exercise 2.2.5(j). 

16 Use the fact that 0 < —b < —aand Exercise 3.5.6. 

17 Use Exercise 2.2.6(c). 
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If x 4 1, we may divide both sides through by 1 — x to have 


1-3 


ak, 


ltxtx?t+rePtxta 


This algebraic observation suggests a general formula for the sum of powers of a 
real number x 4 1. Ifn > 0, then 


_ ntl 


ize Gop 


n 
lteter tte ta" = oak a 
k=0 


Prove Eq. (3.51) with an induction argument on n > 0. 


EXERCISE 3.5.11 Prove the following factorization formula for nonzero real 
numbers a and b and nonnegative integer n.!® 


qitl — prt = (a — b)(a" ta" p+ qr + eee te @2b"-2 4 apt) +b”) 
(3.52) 


Another reason Theorem 3.5.1 is useful is that a result might not be true for all 
positive integers, but only eventually true, that is, true for n > j for some positive 
integer j. Just like we use )77_, ax to denote the sum of the az, we can use the 
notation I17_,a; to denote the product of the terms. In other words, 


Tak = a1 + 2+ a3+++ dy 
EXERCISE 3.5.12 Foralln > 2,1m"_,(1— 4) =}. 
EXERCISE 3.5.13 Ifn > 4 isan integer, then 2” < nl. 
EXERCISE 3.5.14 Ifn > 2 isan integer, then 3 | (n3 —7n). 


The next lemma does not require induction, but is helpful in the exercise that 
follows. 
Lemma 3.5.15 Ifn > 3 is an integer, then n? > 2n +1. 


Proof. Letn > 3 be an integer. Then 


n=n-n>3n=2n+n>2n+1 


5. n > 5is an integer, then n* < 2”. 
EXERCISE 3.5.16 Ifn > 5 isan integer, then n2 < 2” 


18 Don’t make another induction argument. Letting x = b/a in Eq. (3.51) provides a good start. 
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Strong Induction 
There is another way to build the set S on p. 80. Consider the following. Suppose 
S is a subset of the positive integers that has the following properties. 
(K1) leS 
(K2) Ifn >2and1,2,...,..—leéS,thenne S. 


Then by yet another mimicking of the proof of Theorem 3.4.6, we could show that 
S =N. This is called the strong principle of mathematical induction, or SPMI. 


Theorem 3.5.17 (SPMI). Suppose S is a subset of the positive integers that has 
properties K1-K2. Then § = N. 


EXERCISE 3.5.18 This exercise addresses why Theorem 3.5.17 is called strong 
induction. To do so, we alter property I2 slightly to read: Ifn > 2 andn—1e/S, 
then n ¢€ S. For each of the following pairs of statements, state which is stronger. 


(a) n>2andn—-1les n>2and1,2,...,.n-1leS 
(b) Property 12 Property K2 

(c) Properties 11-12 Properties K1-K2 

(d) IfI1and12,then S=N. If K1 and K2, then S=N. 


This is why the SPMI can be more powerful than the PMI. If we are required 
to prove a result is true for all n > 1 and induction seems to be the way to go, 
we might find that regular induction does not provide us with a strong enough 
assumption to make the inductive leap. With either form of induction, we would 
still need to show that 1 € S. But to make the inductive step, regular induction 
would only allow us to assume n € S and require us to conclude n + 1 € S solely 
from this. On the other hand, strong induction allows us to assume that all of 
1,2,...,n —1 € S,and requires us to conclude n € S from this more extensive set 
of assumptions.!? 

In Section 2.5 we defined a positive integer to be prime provided it has exactly 
two distinct positive integer factors. An integer n > 2 that is not prime is called 
composite, and such a number can be written in the form n = ab, where a and 
b are positive integers strictly less than n. The following theorem addresses the 
factorization of positive integers into a product of primes. We use a slight variation 
of the SPMI by rooting it at n = 2. 


Theorem 3.5.19 (Fundamental Theorem of Arithmetic). Every integer n> 
2 can be written as the product of primes, and this factorization is unique, except 
perhaps for the order in which the factors are written. 


19 Actually, strong induction is no stronger than regular induction. They are logically equivalent. 
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Proof. We use strong induction on n > 2 to show existence. Since 2 is prime, the 
result is true for n = 2. So let n > 3 be given, and suppose all of 2,3,...,n—1 
can be written as the product of primes. Now if n is itself prime, then its prime 
factorization is trivial, and the result is true. On the other hand, if 1 is composite, 
then there exist positive integers a and b such that 1 <a <n,1<b<n,andn= 
ab. By the inductive assumption, since 2 < a, b < n — 1,bothaandbcan be written 
as the product of primes. That is 


a= Pip2-:: Ps 
b=192°°'% 


where all pz and gq, are prime. Thus n = p,--- psqi-+-q;, and we have found a 
prime factorization for n. 

To show uniqueness, choose n > 2, and suppose n = pip2--: py and n= 
qiq2:--q are two ways of writing n as a product of primes. We show by induction 
on k that / = k and, with some possible reordering of the g;, that p; = q; for all 
1<i<k. 

If k = 1, then n = py, so that n is prime. Thus / = 1 and p; = q1. So suppose 
k > 2 and suppose that any factorization of a positive integer into k — 1 primes is 
unique up to order of the factors. Since 


P1p2*** Pk =192°** 41 (3.53) 


then px | giqg2--- qi. By Exercise 3.4.19, there exists some j, where 1 < j </,such 
that px | gj. Since g; is prime, py = g;. Reordering the right-hand side of Eq. (3.53) 
by switching g; and q;, then canceling px and gq), we have factorizations of the 
positive integer n/p, intok — 1 and/ — 1 primes. Since px > 2, we have that n/p, < 
n. Applying the inductive assumption to n/ px, we conclude that k — 1 =/—1,and 
with some possible reordering of q1,...,q-1, we have that pj = qj for 1 <i < 
1—1. Thus / = k, the factorization of n into k primes is unique up to the order of 
the factors. 


Strong induction on n can be adapted slightly in the following way. Estab- 
lish a range of base cases 0 <n < k —1, then use a strong inductive assumption 
forn >k. 


EXERCISE 3.5.20 Every integer n > 30 can be written as an integer linear com- 
bination of 6 and 7 using nonnegative multiples. That is, if n > 30 is an integer, 
then there exist nonnegative integers j and k such that n = 6j + 7k.” 


EXERCISE 3.5.21 Let aand b be integers, where a > 0 and b > 0. Prove the 
existence part of the division algorithm (Theorem 2.4.7) using an adapted strong 
induction on b with base cases 0 < b< a—1. 


20 Establish base cases from n = 30 ton = 35. 
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EXERCISE 3.5.22 Suppose n > 1 disks that are black on one side and white 
on the other are arranged in a straight line with a random arrangement of black 
sides up. A game is to be played whereby a black disk is removed, and its imme- 
diate neighbors, if any, are flipped over. (Two disks are not said to be immediate 
neighbors if there have been any empty spaces created between them.) A game is 
winnable if it is possible to remove all n disks. What constitutes a winnable game? 
Prove your claim using strong induction.”! 


EXERCISE 3.5.23 Repeat Exercise 3.5.22 for a similar game where n > 3 disks 
are arranged in a circle. 


Equivalence Relations 
The notion of equality is so central to the mathematical enterprise that we do not 


even think about the ways we manipulate equations. In Chapter 0 we stated three 
assumptions about equality in the real numbers. 


(A1) Properties of Equality: 


(a) Forallae Ra=a. (Reflexive) 
(b) For alla,b € R,ifa=b, then b =a. (Symmetric) 
(c) Foralla,b,c € R,ifa=bandb=c,thena=c. (Transitive) 


In this section, we will consider a variety of sets and define on them a notion 
of equivalence that has the reflexive, symmetric, and transitive properties. As you 
work through these examples and exercises, similarities will begin to show through, 
and these will illustrate other abstract features of equivalences that we will explore 
in Section 3.7. 

Before we get to the examples, we note that the symbol for equivalence can 
vary greatly from situation to situation. Given a set S and two elements x, y € S, 
there are different symbols commonly used to denote that x and y are equivalent, 
whatever that might mean in its particular context. For example, 


x=y 
x=y 
x =n y 
amy 


21 David Beckwith, Glenn G. Chappell, and O. P. Lossers, American Mathematical Monthly Vol. 104, 


9 (Nov. 1997): 876. 
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are but a few. The statement xR y is read “x is related to y,” and derives from 
a way of addressing equivalence in terms of a relation, which we will investi- 
gate in Section 3.11. Since our goal is to address the features of equivalence that 
transcend context, let’s choose one of these common symbols, say =, and stick 
with it. 


Definition 3.6.1 Let S be a non-empty set, and let “x = y” be a statement for 
all x, y € S. That is, for every x, y € S, either x = y or x ¥ y is true. Suppose also 
that = has the following properties on S. 


(E1) Forallxe S,x=x. (Reflexive) 
(E2) For all x, ye S,ifx = y, then y = x. (Symmetric) 
(E3) For all x, y,z € S,ifx = yand y =z, thenx =z. (Transitive) 


Then = is called an equivalence relation on S. 


Here is our first concrete example. On the surface, it might not sound partic- 
ularly mathematical. To the contrary, it is, shall we say, equivalent to all the other 
examples we will study, in that it has properties E1—-E3. 


Example 3.6.2 Let C be the set of all cities on earth. Suppose for the sake 
of argument that there are no one-way roads. Define two cities x, y € C to be 
equivalent, x = y, provided it is possible to drive on roads from city x to city y. 
Then = defines an equivalence relation on C. 


Solution We must verify that properties E1-E3 hold on C. 


(E1) Pick x € C. Since it is possible (trivially) to drive from city x to city x on 
roads, then x = x. 


(E2) Pick x, y € C and suppose x = y. Then it is possible to drive from x to y on 
roads. Since we have assumed no roads are one-way, then it is also possible 
to drive from y to x, since the same roads going from x to y can be traveled 
in the opposite direction. Thus y = x. 


(E3) Pick x, y,z € C and suppose x = y and y =z. Then it is possible to drive 
from x to y on roads, and it is possible to drive from y to z on roads. By 
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beginning at x, driving to y, then to z, we see that it is possible to drive 
from x to z on roads. Thus x = z. 


Since = satisfies properties E1-E3, we have that = is an equivalence relation 
onc. & 


EXERCISE 3.6.3 Name three cities that are equivalent (according to Exam- 
ple 3.6.2) to Beira. 


Example 3.6.2 and those to follow are all cut from the same pattern. They 
involve first defining a criterion by which two elements of a set are declared to be 
equivalent, then demonstrating that this relationship satisfies properties E1-E3. 
To illustrate the general pattern, let’s use the expression P(x, y) to represent a 
generic statement involving x and y. Notice that the two indeterminates x and y 
are ordered in P(x, y). 


Definition 3.6.4 (Sample) For x, y € S, define x = y provided P(x, y). 


Theorem 3.6.5 (Sample). The equivalence = in Definition 3.6.4 is an equiva- 
lence relation on S. 


Proof. We show that properties E1-E3 hold for = on S. 


(E1) Pick x ¢ S.Then...so that P(x, x). Thus x = x. 


(E2) Pick x, ye S and suppose x = y. Then P(x, y). Thus ..., so that P(y, x). 
Therefore, y = x. 


(E3) Pick x, y,z € S and suppose x = yand y = z. Then P(x, y) and P(y, z). Then 
...,80 that P(x, z). Thus x = z. 


Since = satisfies properties E1—-E3, we have that = defines an equivalence relation 
on S. 


Here is one more example in full detail. The expression =z is read “equivalence 
modulo Z.” 


Example 3.6.6 For real numbers x and y, define x =z y provided x — y is an 
integer. Show that =z defines an equivalence relation on the real numbers. 


Solution We show that =z satisfies properties E1—E3. 


(E1) Pick x € R. Then x — x = 0, which is an integer. Thus x =z x. 


(E2) Pick x, y €¢ R and suppose x =z y. Then x — y is an integer. Thus y — x = 
—(x — y) is also an integer, so that y =z x. 


94 


Chapter 3 Sets and Their Properties 


(E3) Pick x, y,z € R, and suppose x =z y and y =z z. Then x — y and y— z are 
both integers. Since the integers are closed under addition, we have that 
x-Z=(x*—y)+ (y—2® €Z. Thus x =7 z. 


Since =z satisfies properties E1—-E3, we have that =z is an equivalence relation 
on the real numbers. Mf 


If some inquisitive soul were to ask for some real numbers that are equivalent 
to 8.363 according to the definition in equivalence in Example 3.6.6, we could take 
the following approach. Since x =z y means that x — y is an integer, we know that 
x =z, 8.363 provided x — 8.363 is an integer. Now if your three favorite integers 
are, respectively, 15, —188, and 0, we would have 


x—8.363=15 x-—8363=-188 and x—8.363=0 (3.54) 


which would yield 23.363, —179.637, and 8.363. With practice, you might develop 
a quicker approach of your own. 


EXERCISE 3.6.7. Let S be the xy-plane, that is, S = {(x, y): x,y €R}. For 
two points (x1, y,) and (x2, y2), define the two points to be equivalent, writ- 
ten (x1, y1) = (42, y2), provided y2 — yy = 3(x2 — x1). Show that = defines an 
equivalence relation on S. Name three elements of S that are equivalent 
to (2, 1). 


EXERCISE 3.6.8 Let F be the family of all sets. Show that Definition 3.1.9 
defines an equivalence relation on F. 


To show that = is an equivalence relation on a set involves showing that all 
three properties E1-E3 are satisfied. Furthermore, each of these properties is a 
universal statement about the elements of the set. Thus to show that = is not an 
equivalence relation, we only need to show that one of the properties fails, and 
this demonstration will involve constructing a counterexample. 


Example 3.6.9 For points in the xy-plane, define (x1, y1) = (x2, y2) provided 
either xj = x2 Or yj = y2. Then = is not an equivalence relation because it is not 
transitive. For example, (1, 2) = (3, 2) and (3, 2) = (3, 4), but (1,2) # (3,4). 


EXERCISE 3.6.10 For integers m and n, define m =n provided |m —n| 41. 
Is = an equivalence relation on the whole numbers? Prove or disprove. 


EXERCISE 3.6.11 For positive integers m and n, define m = n provided mn > 2. 
Is = an equivalence relation? Prove or disprove. 


EXERCISE 3.6.12 Let S be the set of four-letter words in the Random House 
Unabridged Dictionary. For two words x, y € S define x = y if it is possible to 
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construct a sequence of words in S, beginning with x and changing one letter at a 
time to create another word in S at each step, ending up at y. (A sequence of one 
word is considered valid.) For example BABY = POOP because of the sequence 
BABY, BABE, BARE, BARN, BURN, BURP, BUMP, PUMP, POMP, POOP. Is 
= an equivalence relation on $? Prove or disprove. 


EXERCISE 3.6.13 Let F be the family of all sets. For sets A and B, define A = B 
provided either A C B or BC A. Is = an equivalence relation on F? Prove or 
disprove. 


EXERCISE 3.6.14 Suppose we have a square that sits in the xy-plane with its 
center at the origin and rotated so that its corners lie on the coordinate axes. 
Suppose also that we have written the numbers {1, 2,3, 4}, one on each corner, 
as in Figure 3.10. (Assume the numbers are visible on the front and back of 
the square. All we care about is which number is in each of the four possible 
positions.) Call one such writing of these numbers an assignment. For two assign- 
ments x and y, define x = y if it is possible to convert x into y by rotating the 
square around its center point and/or flipping the square over about the x-axis. 
Does = define an equivalence relation on the set of all assignments? Prove or 
disprove. 


EXERCISE 3.6.15 LetF be the family of all non-empty sets. For two sets A, B € 
F, define A = B provided AN Bis non-empty. Is = an equivalence relation on F? 
Prove or disprove. 


EXERCISE 3.6.16 Is <an equivalence relation on the real numbers? 


Figure 3.10 An assignment of numbers to the square. 
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EXERCISE 3.6.17 Is # an equivalence relation on the real numbers? 


Our next example of an equivalence relation is very important in abstract 
algebra, and it pervades Part III of this text. The symbol =¢ is read “equivalence 
modulo 6” (or mod 6). 


EXERCISE 3.6.18 For integers x and y, define x =¢ y provided there exists an 
integer k such that x — y = 6k. This definition says that x =. yif x — yisa multiple 
of 6. Then =¢ defines an equivalence relation on the integers. 


EXERCISE 3.6.19 Construct the set consisting of all integers that are equivalent 
(modulo 6) to 14. 


EXERCISE 3.6.20 What is the smallest nonnegative integer that is equivalent 
(mod 6) to —16? 


The number 6 in our definition of equivalence (mod 6) was chosen arbitrarily, 
of course. We can discuss =, for any positive integer n, where we define x =, y 
provided there exists an integer k such that x — y = nk. Notationally there are two 
other common ways of writing this form of equivalence among integers. They are 


x=y (modn) (3.55) 
x=y (n) (3.56) 


EXERCISE 3.6.21 What is the smallest nonnegative integer x such that 
1195 =11 x? 


EXERCISE 3.6.22 Suppose x is an integer such that x =9 65. What is the smallest 
nonnegative integer y such that x =9 y? 


Equivalence mod n has several important features that we investigate here. 
First, notice that 81 =7 25 because 81 — 25 = 56 = 8 x 7. Applying the division 
algorithm to a = 7 and by = 81, then toa = 7 and b2 = 25, we get 81 =7 x 11+4 
and 25 =7x3+4. So 81 and 25 have the same remainder when divided by 7 
according to the division algorithm. This illustrates the following theorem, which 
says merely that x and y differ by a multiple of n if and only if they have the same 
remainder when divided by n. 


EXERCISE 3.6.23 Suppose x and y are integers and n is a positive integer. Then 
x =, yif and only if x and y have the same remainder when divided by n according 
to the division algorithm. 


EXERCISE 3.6.24 Letnbea given positive integer. Then for every integer x, 
there exists y € {0,1,..., —1} such that x =, y. 
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EXERCISE 3.6.25 Suppose x and y are integers such that x =3 y 43 0. Then 
apie 20 
xy =3 1. 


Sometimes a mathematician doing research believes a certain theorem is true 
and can be proved, but he is mistaken. Often, he works his way through a proof 
only to find a hole in his reasoning, a gap that he cannot cross to arrive at the 
desired conclusion. Analyzing this gap might lead to the discovery of a counterex- 
ample to his supposed theorem, which in turn might also lead to a somewhat 
weaker theorem that can be proved for all cases except for those that are like the 
counterexample. 

In a similar way, textbook authors sometimes think they are devising a really 
cool exercise, only to discover a mistake that they would be embarrassed for 
their colleagues to know about. The last exercises of this section illustrate such 
a mistake that an unnamed author”? made, a principle discovered that generates 
counterexamples, and a subsequent weaker theorem that can be proved. 


EXERCISE 3.6.26 Let S = {(x, y): x, y € Z}, and define (x1, y1) = (x2, y2) pro- 
vided x1y2 = x21. Try to show that = defines an equivalence relation on S. 
Where does the proof break down? Can you use the obstacle to discover a 
counterexample to the claim? If not, look to the next exercise. 


EXERCISE 3.6.27 Show that transitivity fails in the definition of equivalence in 
Exercise 3.6.26.74 


EXERCISE 3.6.28 The set S in Exercise 3.6.26 can be redefined to eliminate 
the one problematic element noted in Exercise 3.6.27, so that the definition of 
equivalence will satisfy properties E1-E3. Redefine S appropriately, and then use 
your work from Exercise 3.6.26 and your new definition of S to prove that = is an 
equivalence relation on your newly defined set. 


Equivalence Classes and Partitions 


In Example 3.6.2, it might have occurred to you that defining two cities to be 
equivalent in terms of accessibility by roads takes all cities on earth and lumps 
them together into groups of cities that are mutually accessible from each other. 
Presumably, for example, there is a network of two-way roads connecting all cities 
on the north island of New Zealand. Thus it is possible to drive from any city on 
the island to any other. As a result, every city on the north island is equivalent to 


If x and y are not equivalent to 0 (mod 3), then what possibilities remain? 

I myself. 

There is one point that every element of S is equivalent to, but there do exist other points that are not 
equivalent to each other. 


98 


Chapter 3 Sets and Their Properties 


every city on the north island, including itself. Furthermore, no city outside the 
north island of New Zealand is equivalent to any city on that island. If that were to 
change, say by some newly constructed bridge between Auckland, New Zealand, 
and Santiago, Chile, then Santiago would become equivalent to all the cities on 
the north island of New Zealand. Furthermore, all cities accessible from Santiago 
would also become equivalent to all cities on the north island of New Zealand. It 
is sort of like the molecules in two drops of water. Either the two drops do not 
touch at all, or, if they do, they instantly merge into one drop. 

This illustrates that there is a lot of strength in the properties E1—E3. One very 
important feat that an equivalence relation performs is that it completely splits 
the set up into non-empty, nonoverlapping subsets. In general, the splitting of a set 
into nonoverlapping subsets is called partitioning. When an equivalence relation 
is defined on a set, it naturally partitions the set into subsets where elements of 
the same subset are all equivalent to each other, and any two elements of different 
subsets are not equivalent to each other. 

In this section, we will define partitions in general and construct some examples. 
Then we will see how an equivalence relation gives rise to a partition of the set on 
which it is defined. 


Definition 3.7.1 Suppose Sisaset and F = {A}isa family of subsets of S$. Then 
F is said to be a partition of S if 


(P1) Every A in F is non-empty; 
(P2) If A, Be F and AN Bis non-empty, then A = B; and 


Let’s illustrate with a specific example before we discuss some general prop- 
erties of partitions. Just remember, a partition of a given set S is nothing but a 
family of subsets of S with some special properties. 


Example 3.7.2 Consider the following four families of subsets of Njo. 


Fr, = {{1}, (2, 3,5, 7}, {4, 6, 8, 10}, (9}} 
Fo = {0, {1,2,3, 4}, (5, 6, 7, 8}, (9, 10}} 
(3.57) 
Fs = {{(1}, (2,4, 6, 8, 10}, {3, 6, 9}, {5, 10}, {7}} 
Fa = {{Primes in Njo}, {Composites in Njo}} 


Of these four families, only F; is a partition of Nip. Notice how F; satisfies 
properties P1—P3. First, no set in F; is empty. Second, they are all disjoint. If we 
choose any two of them and they have a non-empty intersection, then they are 
the same set. Third, forming the union across F; produces Nj. 
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Notice how the other families fail to be a partition of Nio. Since a partition of 
a set must contain only non-empty subsets, F2 fails to be a partition of Njo. In F3, 
there exist two distinct sets with non-empty intersection. Finally, the union across 
F4 is not Nig for 1 is neither prime nor composite. 


In the work we will do in this section, there will be some program by which we 
construct a family of subsets of a given set in order to create a partition. To verify 
that this family does indeed partition the set, we must show that the family has 
properties P1—P3. 


(P1) The program for forming the subsets of S must always generate non-empty 
sets. Thus if we choose an arbitrary set in the family, we must be able to find 
some element of S in the chosen set. 


(P2) Noelement of S can be in more than one distinct set in the family. Thus if we 
choose sets A and B in the family and suppose they have a common element 
(AN B #4 %), we must be able to show A = B. 


(P3) This property says that the generated sets in the family completely exhaust all 
elements of S. Naturally, the program for forming the family of subsets of S$ 
should be defined so that it creates only subsets of S. Then Exercise 3.3.13 
will guarantee that the C property P3 is satisfied. What remains to be shown 
is >, which amounts to showing that every element of S is in some set in the 
family. 


Example 3.7.3. From Example 3.6.2, create a subset of C in the following way. 
Choose a city x, and define A, to be the set of all cities on earth from which there 
are roads to city x. Now let 


F ={Ay:x€C} (3.58) 


the family of all these subsets of C constructed by letting x be every city on earth. 
Then F is a partition of C. 


Solution We verify that properties P1—P3 are satisfied on F. 


(P1) Pick any A € F. By the way sets in F are constructed, there exists x € C for 
which A = A,. Since city x has roads to itself, we have that x € A,so that A 
is non-empty. 


(P2) Pick A,, Ay € ¥, and suppose A, Ay # J. Then there exists z € A, Ay. 
We show that A, = Ay. First pick a € A,. Then it is possible to drive from 
city a to city x on roads. Now since z € A,, there are roads from z to x, so that 
it is also possible to drive from x to z. (Draw a picture!) Furthermore, since 
z € Ay, there are roads from z to y. Thus we may travela > x > z > y,and 
have that it is possible to drive from a to y. Thus a € Ay,so that A, C Ay. By 
an identical argument in the reverse direction A, C A,. Thus A, = Ay. 
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(P3) Since A, C C for all x € C, Exercise 3.3.13 implies that UrecA, C C. To 
show 2, pick any y € C. Now A, € F and ye Ay. Thus y € Uyec Ay, and 
CCUyecAy. 


Among the islands along the inside passage of southeast Alaska is Wrangell 
Island, whose only town, Wrangell, has a population of around 2500. Though there 
are roads between Wrangell and remote parts of the island, it is not possible to get 
to Wrangell on roads from any other city on earth. Thus there is only one element 
of the set AWrangenl, and that is Wrangell. In the construction of F in Eq. (3.58), we 
allowed x to take on the names of all cities on earth to generate all the A, in F. 
Of the vast number of cities on earth, each of which generated its own subset of 
C, Awrangell Was generated only once, when x = Wrangell. 

Contrast this with Aparas. Dallas is an element of A, for thousands of x € C 
because there are so many cities accessible by roads from Dallas. So in the con- 
struction of F, Dallas is placed into such sets as A, ybbocks ABantf» 20d AAmsterdam-~> 
However, in spite of this redundancy, ¥ partitions C because all the sets containing 
Dallas are the same. 


EXERCISE 3.7.4 In Example 3.6.6 we defined an equivalence relation on the 
real numbers by defining x =z y, provided x — y is an integer. For a given real 
number x, define the following subset of the real numbers: 


A,={yeR: y=zxs={veR: y-xeZ} (3.59) 
Create a family of subsets of real numbers by letting F = {A, : x € R}. 


(a) Describe A434 by listing some of its elements. 


(b) Show that F partitions the real numbers by showing it has properties 
P1-P3. 


Now let’s see how an arbitrary equivalence relation on a non-empty set S is 
tied to a partition of S. We will work our way into a theorem tying them together 
by way of some examples. First, a definition that presents standard mathematical 
notation for the sets we have thus far written as A,. 


Definition 3.7.5 Suppose = defines an equivalence relation on a non-empty set 
S. For an element x € S, define 


]={yeS:y=x) 


That is, [x] contains all elements of S that are equivalent to x. This subset of S is 
called the equivalence class of x. 


25 Amsterdam, Missouri, population 281. 
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Example 3.7.6 From Example 3.6.2, the equivalence class of Auckland is the 
set of all cities with roads leading to Auckland, that is, all cities on the north island 
of New Zealand. It is the same as the equivalence class of Wellington. 


Example 3.7.7 For equivalence mod 6 from Exercise 3.6.18, given any integer x, 
[Ix] ={yeZ:y—x=6k, forsomek € Z} 


Thus [x] is the set of all integers y of the form y = 6k + x, where k takes on all 
possible integer values. 


EXERCISE 3.7.8 From Example 3.7.7, describe [—4] by listing some of its ele- 
ments. How many distinct equivalence classes are there in this example? Describe 
all of them by listing some elements from each equivalence class. 


EXERCISE 3.7.9 From Example 3.6.14, how many assignments are in the 
equivalence class of the assignment in Figure 3.10? 


In all the preceding examples of equivalence relations, the equivalence classes 
seem to be a basis of a partition of the set. This is no coincidence, of course. 
Theorem 3.7.10 says that properties E1—-E3 can be used to demonstrate that the 
family of equivalence classes satisfies P1-P3. Most details are left to you as an 
exercise. 


Theorem 3.7.10 Suppose = defines an equivalence relation on a set S. Define 
F = {[x]: x € S}. That is, F is the collection of all equivalence classes generated 
by letting x take on all values in S. Then F is a partition of S. 


Proof. We verify that properties P1-P3 from Definition 3.7.1 are satisfied. 


(P1) Pick any [x] € F. Since x = x, it follows that x € [x]. Thus [x] is not empty. 
(P2) Pick [x], Ly] € F and suppose [x] N[y] isnot empty .... 


(P3) Since every set in F is defined to be a subset of S, we have Ux<s[x] € S by 
Exercise 3.3.13. So we must show D>. Pick ye S.... 


EXERCISE 3.7.11 Complete parts P1—P3 of the proof of Theorem 3.7.10. 


Theorem 3.7.10 is very important for the following reason. If some form of 
equivalence is defined on a set, and it can be verified that this definition is an 
equivalence relation, then Theorem 3.7.10 guarantees that the set is automati- 
cally partitioned into equivalence classes. We can then think of any element of 
a particular equivalence class as being representative of all elements in its class. 
Furthermore, we can think of this equivalence in precisely the same way we think 
of equality, where two equivalent elements are, in many senses, interchangeable. 
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The partitioning of a set into equivalence classes lumps the elements of the set 
into categories where one element can be replaced with any other element in its 
category, within limits, of course. In Section 3.8 we will illustrate this phenomenon 
on the rational numbers, where we investigate the familiar definition of equality 
of two fractions. Then we will show how the binary operations of addition and 
multiplication of rational numbers are defined so that different representative 
elements of equivalence classes are indeed interchangeable. 


Building the Rational Numbers 


As an illustration of the concept of equivalence relation, we want to take a look 
at equality, addition, and multiplication in the integers and see how they give rise 
to equality, addition, and multiplication in the rationals. This section effectively 
dissects of some of the real number properties assumed in Chapter 0 and proved 
in Chapter 2. Thus a few prefatory words are in order about what we are trying to 
accomplish here. 

First, consider the whole numbers. Beginning with the set {0,1}, which by 
assumption A15 is indeed a two-element set, we calculate all possible sums of its 
elements. We know from the behavior of 0 thatO +0 =Oand1+0=0+4+1=1. 
What is not immediately clear is 1+ 1. Is it possible that 1+1=0o0r1+1= 
1? Certain assumptions and properties of real numbers allow you to prove that 
neither of these is possible. 


EXERCISE 3.8.1 Show that 1+1=0and1+1=1are impossible.”° 


Since 1 + 1 is neither 0 nor 1, it might be more efficient to create a new symbol, 
like 2 perhaps, to denote 1 + 1. This is just the beginning of a process of building 
the whole numbers as a set of successors of previously defined whole numbers. 
The assumptions used to build them are called Peano’s postulates. Because they 
are formulated without reference to the set of real numbers as a context, Peano’s 
postulates and their implications generate the set of whole numbers in a somewhat 
more complicated way than we describe here. One of the assumptions is that zero 
is not a successor to any whole number, so that the sequence of successors, which 
we call 1,2,3,...,never circles back around to zero. 

Once the set of whole numbers has been constructed, we extend it to the 
integers by tossing in the additive inverses of all the whole numbers. These additive 
inverses are indeed not whole numbers themselves, except for —0. The reason is 
that if any positive whole number a had a whole number b as its additive inverse, 
then a + b = 0 would imply that 0 is the successor toa +b —1. 

In this section, we want to give ourselves the standard assumptions about equal- 
ity and binary operations as applied only to the integers, as we did in Chapter 0 


26 Consider the trichotomy law and cancellation of addition. 
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for the real numbers. Specifically, let’s make the following assumptions about the 
integers, stated in a form very condensed from that in Chapter 0. 


(Z1) Integer equality is an equivalence relation and therefore has properties 
E1-E3. 


(Z2) Integer addition and multiplication are well defined, closed, associative, 
and commutative binary operations. Furthermore, the integers contain the 
additive and multiplicative identities, the integers are closed under additive 
inverses, and multiplication distributes over addition. 


(Z3) The positive integers are closed under addition and multiplication. 


(Z4) All relevant results from Chapter 2, as derived from the previous assump- 
tions, apply to the integers. 


In what follows, we want to create the rational numbers from scratch, using 
the integers as building blocks.”’ First we construct the set of rationals; then we 
define a notion of equivalence on them using integer equality as our criterion, and 
this definition will coincide with the way we already think of rational equality. 
Then we will define addition and multiplication on the rationals and show that 
they are well-defined operations. Throughout this discussion, we will use the term 
equality exclusively with reference to integers, and equivalence when we refer to 
the similar notion we are defining on the rational numbers. 


Defining Rational Equality 


In Chapter 0, we defined the set of rational numbers as 


Q={p/q:p.qg€Z,q 40} (3.60) 


We normally think of the expression p/q as meaning “p divided by gq.” For now, we 
want to dispense with this meaning and conceive of rational numbers as merely 
ordered pairs of integers, where the second integer is not allowed to be zero. In 
this discussion, the forward slash in p/q is intended to have no meaning, except 
that it separates the first integer from the second. 

There is nothing inherent in the definition of the rationals in Eq. (3.60) that 
suggests how you would address the equivalence of two of its elements. In fact, 
there is more than one useful way to define equivalence on the rational numbers, 
but we will look only at the familiar form that coincides with our notion of rational 
equality. Note that in Definition 3.8.2, we refer to integer equality as =, and we 


In a more abstract algebraic setting, this would be called constructing the field of quotients of an 
integral domain. 
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use juxtaposition to represent integer multiplication. The equivalence denoted = 
is the symbol we are defining on the rationals. 


Definition 3.8.2 For two rational numbers p/g and r/s, we define p/q =r/s 
provided ps = qr. 


Notice the positions of p,q, r,s in Definition 3.8.2. These positions must be 
retained when we translate between p/q = r/s and ps = qr. 


EXERCISE 3.8.3 Answer the following questions as they relate to Defini- 
tion 3.8.2. All alphabetic symbols represent integers. 


(a) The statement a/m = b/n corresponds to what integer equation? 
(b) The statement bc = de translates into what rational equivalence statement? 


(c) Is 6/(—15) = (—4)/10? Why or why not? 


EXERCISE 3.8.4 Assuming properties Z1—Z4 on the integers, show that Defi- 
nition 3.8.2 defines an equivalence relation on the rational numbers. 


Thanks to Exercise 3.8.4, it follows from Theorem 3.7.10 that the rational 
numbers are partitioned into equivalence classes where p/g and r/s are in the 
same equivalence class provided ps = qr. You already have an idea of what these 
equivalence classes look like. 


EXERCISE 3.8.5 Prove the following about the equivalence classes of rational 
numbers. 


(a) For any integer r and nonzero integers q and s,0/q =r/s if and only if r = 0. 


(b) For any rational number p/q and any nonzero integer s, p/q = ps/qs. 


By Exercise 3.8.5, all rational numbers of the form 0/q are equivalent to each 
other. Furthermore, if a/b is rational, a is nonzero, and a and b have a common 
integer factor s, then we may write a = ps and b = qs to have that a/b = p/q. This 
corresponds to the familiar notion that we may cancel out a common factor of 
a numerator and denominator and have an equivalent rational number. We may 
then address an equivalence class of rational numbers by referring to an expression 
of the form p/q, where p and q are assumed to be relatively prime. 


Rational Addition and Multiplication 


Now that we have an equivalence relation defined on the rational numbers, we 
can define a form of addition of rational numbers. We will denote this by @ to 
distinguish it from the integer addition we use to define it. 
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Definition 3.8.6 Given two rational numbers p/q and r/s, we define addition ® 
in the following way. 


p/q ®r/s = (ps + qr)/qs (3.61) 


Observe that Eq. (3.61) guarantees that @ is closed on the rationals. Since 
integer addition and multiplication are closed, then ps + qr is also an integer, and 
the fact that g and s are nonzero integers implies that qs is also a nonzero integer. 
Thus (ps + qr)/qs is indeed a rational number. 

Now we must show that addition in the rational numbers is well defined. Notice 
in the next exercise how that the equations use rational equivalence (=) and 
addition @. When you translate them into their equivalent integer form, you will 
use standard integer equality, addition, and multiplication. 


EXERCISE 3.8.7 Addition of rational numbers as defined in Eq. (3.61) is well 
defined. That is, if p/q,r/s, t/u, v/w are rational numbers, and if p/g =r/s and 
t/u =v/w, then p/q ® t/u=r/s ®v/w. 


EXERCISE 3.8.8 Show that @ is commutative. 


Notice from Eq. (3.61) that anything from the equivalence class of 0/s functions 
as an additive identity in the rationals, for 


p/q ® 0/s = (ps + q- 0)/qs = ps/qs = p/q (3.62) 
Also, notice (— p)/q and p/(—gq), which are equivalent, function as —(p/q). 


Definition 3.8.9 Given two rational numbers p/q and r/s, we define multipli- 
cation ® in the following way. 


(p/q) ® (r/s) = pr/gs (3.63) 


Clearly, multiplication is closed and commutative. Proving multiplication is well 
defined is easier than showing addition is well defined. 


EXERCISE 3.8.10 Multiplication of rational numbers as defined in Eq. (3.63) 
is well defined. That is, if p/q, r/s, t/u, v/w are rational numbers, and if p/g = r/s 
and t/u = v/w, then (p/q) ® (t/u) = (7/s) ® (v/w). 


Notice that any rational number of the form p/p functions as a multiplicative 
identity, and that all rational numbers of this form are in the same equivalence 
class. Also, recall that a rational number is an additive identity if and only if it is of 
the form 0/q. Thus if p/q is rational and p 0, then a multiplicative inverse of p/q 
exists in the rationals. For (p/q) ® (qg/p) = (pq)/(gp) = (pq)/(pq). Furthermore, 
any rational number in the equivalence class of g/p functions as a multiplicative 
inverse of p/q. 
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Roots of Real Numbers 


Up to this point, we have made no reference to assumptions A20-A22 from 
Chapter 0. In this section, we want to explore A22: 


(A22) For every positive real number x and any positive integer n, there exists 
areal solution y to the equation y” = x. Such a solution y is called an nth 
root of x. 


You might wonder why assumption A22 refers to the solution of the equa- 
tion y” = x in order to address roots of real numbers instead of using the familiar 
expression %/x. To do otherwise is to get the cart before the horse. The expres- 
sion 2/x derives its meaning from solutions y to the equation y" = x. Whether 
such solutions exist, and how many such solutions there are must be derived 
from A22 and other properties of real numbers that we have proved so far. In 
some cases, a solution of y” = x will not exist, and we will not be able to define 
/x in these cases. In other cases, multiple solutions will exist, so that 2/x would 
be ambiguous. Thus we do not want to use the expression 4/x until we are cer- 
tain of two things. First, we must be certain that such a real number exists, and 
second, in case there is more than one real solution to the equation y” = x, we 
agree on which solution we will always intend. By the end of this section, the 
expression %/x will be clearly and uniquely defined for appropriate values of x 
and n. 

Another point to make here is that assumption A272 is not a standard axiomatic 
assumption in a rigorous study of the real numbers. It actually follows from the 
Least Upper Bound axiom. In this text, we accept it without question. Here is the 
main theorem we want to prove in this section. 


Theorem 3.9.1 Let x be any real number and n a positive integer. Then the 
following hold. 


(X1) The equation y” = 0 has the unique solution y = 0. 


(X2) Concerning even roots: 


(a) If x > 0, the equation y*” = x has precisely two distinct real number 
solutions, and these are additive inverses of each other. 


(b) If x < 0, the equation y*” = x has no real number solution. 


(X3) Concerning odd roots, the equation y2”+! 


= x has a unique solution. 

Property X1 should be clear, for certainly 0” = 0, and if y” = 0, then y=0 
from the principle of zero products. The following exercise will lead you sys- 
tematically through the proof of claims X2—X3. Remember, A22 states that 
that for every x > 0 and positive integer n, the equation y” = x has some real 
solution y. 
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EXERCISE 3.9.2 Prove Theorem 3.9.1 by demonstrating the following. In all 
parts, n is a positive integer. 


(a) Existence in X2(a): Let x > 0, and suppose yo is a real number solution to 
y*" = x. (Notice yo 4 0.) Show that —yo is also a solution to y2” = x.78 


(b) Nonexistence in X2(b): Suppose x < 0. Explain why there is no real number 
y such that y2” = 


(c) Existence in X3 for x < 0: Suppose x < 0. Prove that there exists a real 
number solution y to y2"+! = x29 


(d) Nonexistence in X3 of solutions of opposite sign: Suppose yo is a solution to 
y-n+l — x, Prove that x > 0 if and only if yo > 0. 


(e) Uniqueness in X2(a) and X3 (positive case): Let x > 0. If y) > 0 and y2 > 0 
satisfy yj = x and y; = x, then yj = io" 


(f) Diniguencss in X3 (negative case): Let x < 0. If y; <0 and y2 < 0 satisfy 
Qntl _ Qn+1 _ i, St 
yy = xand y,”" =x, then yj = yp. 


With Theorem 3.9.1, we can now introduce the notation %/x and define it 
unambiguously. 


Definition 3.9.3 If x is a real number and n is a positive integer, then *"\/x 
is defined to be the unique solution y of the equation y*”+! = x. If x > 0, the 
expression %/x is defined to be the unique, nonnegative solution y of the equation 


yn =x. 


Theorem 3.9.1 and Definition 3.9.3 lead us to the following principles of alge- 
braic manipulation. For x > 0, y*” = x if and only if y = + 2/x. Similarly, for all 
real numbers x, y*”+! = x if and only if y= 7"/x. 


EXERCISE 3.9.4 Suppose x, y > 0. Show that if x < y, then /x < /y.? 


Irrational Numbers 


Accompanied by several axioms we have not discussed in this text, the set {0, 1} 
gives rise to the whole numbers, which is the smallest superset of {0, 1} that is 
closed under addition and multiplication. However, the whole numbers are not 
closed under additive inverses, so we can expand them to include these additive 


28 Exercise 3.5.6 should come in handy. 

29 Use the fact that —x > 0. 

30 Use the factorization formula in Exercise 3.5.11. 
31 Apply part (e) to —yy and —y. 


Exercise 2.2.6(a) should make this quick. 
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inverses, arriving at the integers. The integers are not closed under multiplicative 
inverses, so with a program like that in Section 3.8, we extend the integers to the 
rationals, as they were defined in Eq. (3.60). 

The extension of the whole numbers to the integers is straightforward, in that it 
merely involves tossing in {—1, —2, —3, ...}, where these additive inverses are sim- 
ply declared to exhibit the behavior of additive inverses. However, the extension 
of the integers to the rationals is a different sort of program. The set of rational 
numbers is not a superset of the integers in the same way that the integers repre- 
sent a superset of the whole numbers. Rational numbers are conceived as integer 
pairs p/q, where q is nonzero, and rational equality, addition, and multiplication 
are created from scratch, using equality, addition, and multiplication of integers 
as ingredients. 

By creating the rational numbers from the integers as we did, we say that the 
integers can be isomorphically embedded in the rationals. Given an integer p, we 
may replace p with the rational expression p/1. We then can see that rational 
addition of these replacement forms behaves in a way that corresponds to integer 
addition: 


pPlegil=(p-141-9/d-)=(p+q/1 


where (p + q)/1 corresponds to p+ q in the integers. A similar result holds for 
multiplication: 


(p/1) ® (4/1) = (pq)/.- 1) = (pq)/1 


The rational numbers therefore comprise a smallest possible extension of {0, 1} 
that has closure of addition and multiplication, existence of additive and multi- 
plicative identities, and closure under additive and multiplicative inverses. In other 
words, the rationals are the most efficient possible extension of {0, 1} that satisfies 
assumptions A1l—A19. 

Up to this point, we have said virtually nothing about irrational numbers— 
real numbers that are not rational. You have worked with lots of numbers that 
are irrational: z, e, and most numbers of the form +/x. You have also been taught 
that irrational numbers have a decimal representation that does not terminate 
and does not fall into a pattern of repetition. However, you might never have seen 
a definitive argument that any particular number is irrational. In this section we 
address the existence of irrational number. But first, a little history. 

Some of the intuitive properties of the real numbers that we likely take for 
granted were not a part of the thinking of the ancient Greeks, most notably the 
Pythagoreans, that very secret order of thinkers who produced some amazingly 
sophisticated mathematics. One such intuitive feel we probably have is that the 
set of real numbers is a sort of continuum, a set that can be visualized in terms of 
the ordered points on a straight line, where rationals and irrationals are spread up 
and down like strewn grains of salt and pepper, leaving no holes in the continuum. 
The Greeks had no concept of irrational numbers at first, and now is a good time 
to touch briefly on some of their views. 
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To the Greeks, numbers were conceived in terms of the geometric con- 
cepts of length, area, and volume, all of which were constructible according 
to certain idealized rules. They had very sophisticated techniques for imagining 
their idealized construction using the only two geometric shapes they considered 
perfect: straight line segments (crudely constructible with a straight-edge) and 
circles (crudely constructible with a compass). In his all-important compilation 
of the best of Greek mathematics (a five-volume set called Elements), Euclid 
demonstrated amazingly sophisticated mathematical results, such as the theo- 
rem attributed to Pythagoras, using only some assumptions about circles and line 
segments. 

With these techniques of construction, it is possible to imagine the drawing, 
if you will, of a line segment whose length is any positive rational number in the 
following way. Beginning with a line segment whose length is arbitrarily declared 
to be one unit, it is possible to construct segments representing 2,3, 4, and so on, by 
extending the given segment with a straightedge, then twirling a compass around 
to tack the measured unit length onto itself end to end. It is also pretty easy to 
take a segment of length n, and use similar triangles to construct a line segment 
of length 1/n. (See Figure 3.11.) With another technique by which segments of 
length a and b can be used to construct a segment of length ab, all the positive 
rational lengths are constructible. 


EXERCISE 3.10.1 Devise a technique similar to that described in Figure 3.11 
by which a segment of length ab can be constructed from segments of length a 
and b. 


The fact that segments of arbitrarily long integer length n could be constructed 
meant that segments of arbitrarily short length 1/n could also be constructed 
by the reciprocation technique. Such a short line segment could then, in the- 
ory, be added to itself as many times as needed to produce a segment of any 
arbitrary length m/n. The Greeks erroneously assumed the converse—that a 
segment of any constructible length could be constructed by imagining it to 
be a finite sum of very short segments of the form 1/n. Slap any segment 


Construct segment AB of length n. 
Construct segment AC of length 1. 
Construct segment BC. 

Construct segment FE parallel to BC. 
Segment AE has length 1/n. 


OO 


yn 


Figure 3.11 Constructing length 1/n using similar triangles. 
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down onto the imaginary paper using techniques of construction. How long is 
it? Some whole multiple of a (possibly very short) segment that can be con- 
structed by reciprocating a segment of positive integer length n. In modern 
language, what number is represented by the length of a segment? It is always 
a rational one. 

Now suppose we are given two segments of arbitrary lengths. If both of them 
can be visualized in this way, what sort of relationship must exist between these 
two segments? In the same way that we would find a common denominator of two 
fractions m/n and p/q, they would say that there is a single segment, perhaps very 
short, that can be attached to itself a finite number of times to produce each of 
the two given segments. In our language, 1/nq can be added to itself mq times to 
produce m/n and np times to produce p/q. This assumed relationship between two 
segments is called commensurability. The Greeks believed that all constructible 
segments are commensurable, which is equivalent to our believing that all real 
numbers are rational. 

There was a problem with this, and the Greeks eventually figured it out. Take 
a segment of length one, then construct another segment of length one at one 
endpoint and at a right angle to this first segment. Then sketch the hypotenuse. 
This hypotenuse line segment is obviously constructible. We just described how 
to do it easily. Furthermore, by the Pythagorean theorem, its length d satisfies 
d* = 2.The problem is that it is not commensurable with some other constructible 
segments. That is, d is not rational. This is the amazing result that the Pythagoreans 
discovered, and it created no small crisis. Given that the theorem traditionally 
named after Pythagoras was already known, it boiled down to this fact: Since “the 
sum of the [areas of the] squares on the legs of a right triangle is equal to the [area 
of the] square on the hypotenuse” (see Fig. 3.12), then, as we would write it, /2 is a 
constructible length. Therefore, /2 is commensurable with 1, or, in our language, 
./2 must be writable in the form p/g, where p and q are integers and q is nonzero. 
The traditional proof that this is impossible is attributed to Aristotle. And, by the 
way, it is the one you will discover in the following exercise, with a few hints. 


Figure 3.12 A sketch of the Pythagorean theorem. 
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EXERCISE 3.10.2 2 is irrational.33-34-35.36.37 


A few words are in order about irrational numbers. Let’s work backwards from 
Exercise 3.10.2. Exercise 3.10.2 says V2 is not rational, while Theorem 3.9.1 says 
/2 is real. Thus there do, in fact, exist real numbers that are irrational. But 
Theorem 3.9.1 is based on assumption A22, which we have said very little about. 
As we said, assumption A22 is not an axiom of the real numbers. It can be proved 
from the Least Upper Bound axiom, which is a standard axiom of the real numbers. 
So what we discover is that the irrational numbers owe their existence to the Least 
Upper Bound axiom. 


EXERCISE 3.10.3 The proof of Exercise 3.10.2 can be easily generalized to 
demonstrate that ./p is irrational for any prime number p. Explain how your 
proof of Exercise 3.10.2 can be adapted into a proof of this more general claim.*® 


EXERCISE 3.10.4 The sum of a rational and an irrational is irrational. 


EXERCISE 3.10.5 The product of a nonzero rational and an irrational is 
irrational. 


EXERCISE 3.10.6 Let a,b,c,andd be rational numbers, and suppose a + b/2 = 
c + dV2. Does it follow that a = c and b = d?“° 


Relations in General 


Equivalence relations are just one example of the more general mathematical 
idea of a relation. A relation is a set construction that puts all kinds of ele- 
ment comparisons such as equality, less than, divisibility, and subset inclusion 
into one mathematical idea. It is also a way of linking elements of two different 
sets together, and it is a context in which functions can be defined. In this section, 
we define a relation and look at examples and special kinds of relations. But first, 
we define the Cartesian product of two sets A and B by 


Ax B={(a,b):a€A,be B} (3.64) 


Naturally, you will want to assume V2 is rational and arrive at a contradiction. 
Write /2 = m/n, where you can assume no common factors between m and n. 
Square both sides and look at the contrapositive of Corollary 2.4.5 . 

If m2 is even then, .... What does this mean? Cancel out a 2. 

So n? is even. What contradiction does this cause? 

Corollary 2.5.12 might help. 

See Exercise 1.2.18(i). 

If b Ad, then what must be true of 2? 
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the set of ordered pairs where the first term in the pair is an element of A and the 
second is an element of B. A relation from A to B is defined to be any subset of 


Ax B. 


Example 3.11.1. For A= {1,2,3} and B= {11, 12, 13, 21, 22, 23, 31, 32, 33}, 
the set 


R= {(, 11), @, 21), (2, 22), (3, 31), (3, 32), (3, 33)} (3.65) 


isarelationfrom Ato B. Hi 


In this section, we are going to delve only into subsets of A x A, which we call 
a relation on A instead of a relation from A to A. 


Example 3.11.2 Let A = {1, 2,3, 4,5, 6}. Then 


R= {d,3), 0,5), 2, 4), (2, 6), G, 5), (4, 6} (3.66) 


isarelationon A. Hi 


You might have seen the Cartesian plane written as R x R, or more succinctly 
as R?. If R is a relation on the real numbers, we can represent it graphically as a 
set of points in the xy-plane. 


Example 3.11.3 {(x, y): y>|x|+1} is a relation on the real numbers 
(Fig.3.13). I 


EXERCISE 3.11.4 For each of the following relations on the real numbers, 
sketch the set of included points in the xy-plane. 


Figure 3.13 The relation y > |x| +1. 
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(a) {(x, y):x < y} 
(b) {@, y) x? +y <4} 
(c) {@, y): |x| < 
(d) {a yirty>]} 


You might think there is nothing particularly exciting about a relation, since 
any collection of ordered element pairs qualifies as a relation. What makes the 
idea take shape and become mathematically important is that we can lay out 
certain criteria by which pairs are included in the relation, and these differ- 
ent criteria might have particular properties that make interesting statements 
about A. 


Example 3.11.5 Define a relation RC Q x Q by R= {(p/q,r/s) : ps = qr}. 
This relation consists of all pairs of rational numbers that are equivalent, as was 
defined in Definition 3.8.2. So (3/8, —30/—80) and (0/2,0/12) are in R, while 
(2/5,5/3) isnot. Hf 


Example 3.11.6 R; = {(a,b): a <b} and R2 = {(a,b): a| b} are relations on 
the integers) 


Example 3.11.7 Foraset A,# and A x Aarerelationson A. Mf 


The power set of a set A is defined to be the family of all subsets of A and is 
written P(A). For example, if A = {1, 2, 3}, the power set of A is 


P(A) = {@, {1}, {2}, {3}, {1, 2}, (1, 3}, (2, 3}, {1, 2, 3}} (3.67) 


EXERCISE 3.11.8 List all elements of the power set of the following sets. 


(a) @ 

(b) {1 

(c) {1,2} 

(d) {1,2,3,4} 


Example 3.11.9 For A = {1, 2, 3}, define a relation on the power set of A by R = 
{(A1, A2) : Ay © Az}. Thus ({1}, {1}) and (, A) are in R, but (A, {2}) and ({1, 2}, 0) 
are not. 

In the previous examples, a particular pair (x, y) is included in the relation 
if some stated property P(x, y) is true. That is, (x, y) € R if x and y are related 
according to some criterion. We give names to some relations when the criterion 
for inclusion of ordered pairs in R has certain properties. 
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Definition 3.11.10 Suppose R C A x A has the following properties. 


(E1) For all x € A, (x, x) € R. (Reflexive) 
(E2) For all x, y € A, if (x, y) € R, then (y, x) € R. (Symmetric) 
(E3) For all x, y,z € A,if (, y), (y, z) € R, then (x, z) € R. (Transitive) 


Then R is called an equivalence relation on A. 


Notice how E1-E3 here say the same things as E1—-E3 from Definition 3.6.1, but 
in different language. Instead of saying that x = x for all x € A is a true statement, 
we say that all ordered pairs of the form (x, x) are in the relation. Instead of saying 
that the truth of x = y implies the truth of y = x, now we say that the inclusion 
of the ordered pair (x, y) in the relation is always accompanied by the inclusion 
of (y, x). And instead of saying x = y and y = z implies x = z, we say that the 
presence of (x, y) and (y, z) in the relation is always accompanied by the presence 
of (x, z). Thus an equivalence relation on A is a subset of A x A with some special 
properties that motivate a partition of A. 

Anytime we define a term that describes how elements of a set may or may 
not compare to each other, we can use that basis of comparison as a criterion for 
inclusion in a relation. For example, the symbol < denotes a way that two real num- 
bers relate or compare to each other. In Exercise 3.11.4(a), the statement x < y 
is a criterion for the inclusion of a point (x, y) in a relation. The relation defined 
by < has some special properties that motivate a definition of another type of 
relation. Since it is common to write xRy as a quicker way to write (x, y) € R, 
and to say that x is related to y instead of saying (x, y) is an element of the rela- 
tion, we use this somewhat more efficient notation in the next definition and 
example. 


Definition 3.11.11 Suppose R is a relation on a set A with the following 
properties. 


(O1) For all x € A, xRx. (Reflexive) 
(O2) For all x, y € A,if xRy and yRx, then x = y. (Antisymmetric) 
(O3) For all x, y,z € A,ifxRy and yRz, then xRz. (Transitive) 


Then R is called an order relation on A, or a partial ordering of A. 


Example 3.11.12 Show that the relation in Exercise 3.11.4(a) is an order 
relation on the real numbers. 

Solution We show that R has properties O1-O3. 

(O1) Since x = x for all x € R, we have that x < x,so that xRx for all x. 


(O2) Suppose xRy and yRx. Then x < y and y < x,so that x = y. 
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(03) Suppose xRy and yRz. Then x < y and y < z. Nowif x= y, then z—x= 
z—y=0,so that x < z. Similarly, if y = z, we have that x < z. Ifx < y and 
y < z, then x < z by Exercise 2.2.5(e). In any case, xRz. 


Since R has properties O1-O3, it defines an order relation on the real 


numbers. 


We can be even more efficient with our language and notation than we were in 
Example 3.11.12 by dispensing with the statement xRy and saying simply that < 
defines an order relation on the real numbers. That would make the verification 
above look like this. 


Solution We show that < has properties O1-O3. 


(O1) For all x € R,x =xso that x < x. 
(O02) Ifx < yandy <x, thenx= y. 


(O3) Suppose x < y and y <z. If x= y, then z—x=z—y>0, so that x < z. 
Similarly, if y = z, we have that x < z. If x < y and y < z, then x < z by 
Exercise 2.2.5(e). In any case, x < z. 


Since < has properties O1-O3, it defines an order relation on the real 
numbers. & 


Verifying the claim in the next exercise will be quick if you will reference the 
applicable results from Section 3.2. 


EXERCISE 3.11.13 If Ais any set, then C defines an order relation on P(A). 
For the set {1, 2, 3}, the order relation defined by © can be illustrated with a 


directed graph as in Figure 3.14. The arrow from {1} to {1,2} indicates that the 
former is related to the latter by C. Notice that {1, 2, 3} is reachable from {1} by 


{1,2,3} 
{1,2} {1,3} {2,3} 
Raa 
{1} {2} {3} 
| 
{} 


Figure 3.14 Directed graph of the partial order relation C on P({1, 2, 3}). 
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a directed path by way of either {1, 2} or {1, 3}, and either of these directed paths 
indicates that {1} C {1, 2, 3}. 


EXERCISE 3.11.14 Divisibility defines an order relation on the positive 
integers. 


EXERCISE 3.11.15 Sketch a directed graph to illustrate the order relation of 
divisibility on {1, 2,3, 4, 6, 8, 9, 12, 18, 24, 27, 36, 54}. 


Another type of relation is reminiscent of <. We use the expression xRy to 
mean (x, y) ¢ R. 


Definition 3.11.16 Suppose R is a relation on a set A with the following 
properties. 


(S1) For all x € A,xRx. (Irreflexive) 
(S2) For all x, ye A, if xRy, then yRx. (Asymmetric) 
(S3) For all x, y,z € A,ifxRy and yRz, then xRz. (Transitive) 


Then R is called a strict order relation on A, or a strict ordering of A. 


Example 3.11.17 Show that < defines a strict order relation on the integers. 
Solution We show that < satisfies properties S1-S3. 


(S1) Since x — x = 0, we have that x — x 4 0,sox & x for all integers x. 
(S2) Suppose x < y. Then y—x > 0,so that y—x 4 0. Thus y 4 x. 
(S3) By Exercise 2.2.5(e),ifx < yandy <z,thenx<z. Hf 


EXERCISE 3.11.18 We may use the results of Example 3.11.17 to define a rela- 
tion on the rational numbers. In this exercise, if p/q and r/s are rational numbers, 
we assume that g and s are both positive. Define <g to be a relation on the ratio- 
nal numbers, where p/q <q r/s provided the integer inequality ps < qr is satisfied. 
Use the fact that < is a strict order relation on the integers to show that <g is a 
strict order relation on the rationals. 


EXERCISE 3.11.19 For any set A, C defines a strict order relation on P(A). 


One notable difference between < on the integers and divisibility on the pos- 
itive integers is that any pair of integers we choose is comparable in one way or 
the other by <. That is, for all integers a and b, either a < b or b < a. However, 
for divisibility, there are many pairs of positive integers that are not comparable 
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at all. For example, neither 6 | 10 nor 10 | 6 is true. This distinction motivates yet 
another type of relation. 


Definition 3.11.20 Suppose R is a relation on a set A with the following 
properties. 


(T1) For all x € A,xRx. (Reflexive) 
(T2) For all x, y € A,ifxRy and yRx, then x = y. (Antisymmetric) 
(T3) For all x, y,z € A,ifxRy and yRz, then xRz. (Transitive) 
(T4) For all x, y € A, either xRy or yRx. (Totality) 


Then R is called a total order relation on A, or a total ordering of A. 


Notice that properties T1-T3 are the same as O1—O3, so a total order relation 
is a special kind of order relation where every pair of elements is comparable in 
one way or another. 


Example 3.11.21 The relation < is a total ordering of the realnumbers. M& 


Example 3.11.22 For A = {1, 2, 3}, C does not define a total order of P(A), for 
{1,2} ¢ {2, 3} and {2,3} Z {1,2}. i 


One of our assumptions from Chapter 0 is the well-ordering principle. To say 
that ais the smallest element of a non-empty subset of the whole numbers is to say 
that a < x for all x in the set. The last type of relation we define is a generalization 
of the well-ordering principle. 


Definition 3.11.23 Suppose R is a relation on a set A with the following 
properties. 


(W1) For allx € A,xRx. (Reflexive) 
(W2) For allx, y € A,ifxRy and yRx, then x = y. (Antisymmetric) 
(W3) For all x, y,z € A, if xRy and yRz, then xRz. (Transitive) 


(W4) If Sis any non-empty subset of A, then there exists a € S such that aRx for 
allx € S. (Well ordering) 


Then R is called a well order relation on A, or a well ordering of A. 


Because properties W1—W3 are the same as O1—O3, a well ordering of A is an 
order relation. The feature that property W4 adds is that every non-empty subset 
of A has what we might call a least element. 
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EXERCISE 3.11.24 The least element of a well-ordered set is unique. 


EXERCISE 3.11.25 Show that a well ordering is a total ordering by showing 
that property W4 implies T4. 


Though a well ordering is a total ordering, a total ordering is not necessarily a 
well ordering. For example, < is a total ordering of the integers that is not a well 
ordering, for the entire set of integers does not contain a least element. Since W4 
implies T4, but not vice versa, property W4 is therefore stronger than T4. 

Just because a given a set with an order relation fails to be a well ordering, 
it does not mean that the set cannot be well ordered by some other relation. In 
fact, one of the most notable results in the modern theory of sets is that any set 
can be well ordered. Ernst Zermelo demonstrated this in 1904, using an axiom of 
set theory that we have said nothing about so far in this text. The axiom of choice 
is a somewhat mysterious axiom of set theory that is simple to state but seldom 
addressed at this level of the mathematical game. It says, “Given any family F of 
mutually disjoint non-empty sets, there is a set S that contains a single element 
from each set in ¥.” Thus S can be thought of as the result of having chosen a 
distinct representative element from each set in F. In the axiom of choice, there is 
enough strength to demonstrate that for any set, there exists a relation on A x A 
that is a well ordering of A. One way to see how the integers can be well ordered 
is to list its elements as (0, —1, 1, —2, 2, —3,3, —4, 4, ...) and define xRy if x does 
not come after y in this listing. When the integers are ordered in this way, every 
non-empty subset has a least element. 


4.1 


Functions 


Second only to sets, functions are likely the most important mathematical concept. 
Interestingly, functions can be defined solely in terms of sets, and some authors 
take that very formal approach. In this chapter, we define functions in a somewhat 
more informal way, and then we study some of the most important basic results 
about functions. 


Definition and Examples 


In Section 3.11 we defined a relation from A to B as any set of ordered pairs (x, y), 
where x € Aand y € B. Amapping from A to Bis another term for relation, though 
it conjures up the imagery that, given a particular element of A, some process or 
computation is done on it to pair it with an element or elements of B. We use this 
input-output imagery to define a function as a mapping with two special features. 


Definition 4.1.1 Given two non-empty sets A and B, a function f is a rule, or 
set of instructions, by which each element of A is paired with exactly one element 
of B. Set A is called the domain and is denoted Dom f. Set Bis called the codomain. 
Notationally, if f is a function from A to B, we write f : A > B.Ifx € Ais paired 
by f with y € B, we write f(x) = y orxtb y. We say that y is the image of x, or 
that x maps to y, and x is a pre-image of y. The subset of B consisting of the images 
of all elements of A is called the range and is denoted Rng f. (See Figure 4.1 for 
a basic sketch.) 


Let’s sort out the details of Definition 4.1.1. First, for a mapping f: A— B 
to be a function, every x € A must have some image y € B. That is 


(F1) For every x € A, there exists y € B such that f(x) = y. 


In addition to property F1, the rule defining f must produce a unique f(x) for 
all x € A. We say that f is well defined if f(x) is unique for every x € A. How can 
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Figure 4.1 


Figure 4.2 
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f 


go 


Schematic of a function: f : A > B. 


The image of x is not unique. 


we say mathematically that the image of x € A must be unique? (See Figure 4.2 
for a sketch of what must not happen.) One way to say that the rule for f cannot 
produce multiple y-values for a single x is to say 


(F2) If y1, y2 € Bare such that f(x) = y1 and f(x) = y2, then yj = yp. 


At first glance, F2 might seem like a statement that would be true for any 
mapping. After all, if f(x) = y1 and f(x) = y2, then the transitive property of 
equality ought to allow us to say yi = y2. Unfortunately, this way of stating F2 
does not reveal potential problems in the way f is defined. Here are two examples 
that illustrate how a mapping can fail to be well defined. 


Example 4.1.2  Itis possible that the rule defining f : A > Bcan be ambiguous. 
For example, let f : R* — R be defined in the following way. For x > 0, define 
f(x) to be a solution y to the equation y* = x. You cannot doubt that this set 
of instructions generates a real number f(x) from any x in the domain, so that 
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property F1 is satisfied. However, the rule is ambiguous for every x in the domain. 
Letting x = 25, y, = 5 and y2 = —5, we have demonstrated the existence of y; and 
y2 in the codomain where f(x) = y; and f(x) = y2,but where y,} 4 yo. 


Example 4.1.2 illustrates a possible pitfall you might run into with the notation 
f(x) if you have not verified that f is well defined. Unless you know that f is well 
defined, you might find yourself using an expression like f(25) as one name for 
more than one distinct thing. We must make sure that the set of instructions for 
generating f(x) does not produce more than one value for any x in the domain. 

Here is another example of how a mapping can fail to be well defined, this time 
because a single element in the domain might have more than one distinct name. 


Example 4.1.3 Define f : Q— Z in the following way. For a rational num- 
ber p/q, define f(p/q) = p. That is, the image of a rational number written in 
a standard form of integer over integer is defined to be the numerator. Unlike 
Example 4.1.2, there is no ambiguity concerning what f(p/q) is. The problem here 
is that the standard definition of equality in the rationals (Section 3.8.1) lumps a 
lot of different expressions of the form p/q into the same equivalence class. As a 
specific example, though 2/5 = 6/15, f(2/5) 4 f(6/15). The fact that one domain 
element can be addressed by more than one name causes problems, because the 
image of an element of the domain depends on the form in which it is written. 
Thus f is not well defined because we have exhibited an x in the domain and y; 
and y2 in the codomain where f(x) = y1, f(x) = y2,buty1 4 v2. = 


Example 4.1.3 illustrates a good way to restate property F2. If x; and x2 are 
two names for the same thing, that is, if x; = x2, then the rule must produce the 
same functional value for x; and x2. That is, f(x,) = f(x2). 


(F2) If x1, x2 € A and xj = x2, then f(x) = f(x). 


Suppose someone gives us two sets A and B, and arule f for pairing elements 
of A with elements of B. If we are asked to verify that f is a function, then we 
must verify that F1-F2 hold. 


Example 4.1.4 Show that f : R— {1} — R defined by 
AP 


fe) = 


is a function. 
Solution 
(F1) Pick any x € R — {1}. Then by properties A3 and A9 (closure of addition 


and multiplication, respectively), x + 2 and x — 1 are both real numbers. 
Furthermore, since x # 1, then x — 1 is nonzero, so that (x — 1)~! exists as a 
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real number by property A13. Finally, (x? + 2) - (x — 1)~! € R by closure of 
multiplication. Thus f(x) exists in R. 


(F2) Pick x1, x2 € R — {1} and suppose x; = x2. Since addition and multiplication 
are well defined, it follows that iG; 4+2= iy +2 and x; —1= x2 —1. By the 
uniqueness of multiplicative inverses, (x, — ab ee 6 Again, since 
multiplication is well defined, ae +2)/, -1)= Ge + 2)/(x2 — 1), so that 
f(x1) = fx). of 


Example 4.1.4 illustrates a broad result that is pretty easy to see through the 
example alone. If f is a rule that can be written in the form of polynomial over 
polynomial, that is, if 


(2 es (4.1) 

Dmx" + by —1x"—* + +++ + b1x + bo 
where all the a; and b; are real numbers, then assuming we avoid values of x that 
make the denominator zero, f is a function from an appropriate subset of the 
real numbers into the real numbers. Here is another way to say it. Since addition 
and multiplication are well defined in the real numbers, and since additive and 
multiplicative inverses are unique, then any mapping whose rule is built up from 
the operations of +, —, x, + and whose domain avoids division by zero will be 
well defined. 

We can go even further. Thanks to Theorem 3.9.1 and Definition 3.9.3, we have 
ensured that ./x is well defined for all real x if n is odd, and for all nonnegative 
x if n is even. Therefore, we can say that any mapping whose rule is built up 
algebraically with +,—, x,+,0”, and %/ , and whose domain is a subset of the 
real numbers that avoids division by zero and even roots of negative numbers will 
be well defined. For example, 


1+ /x3 — 1/(x — 3) 


~—14+x75 


fix) =S4+x4+ ; (4.2) 


is a function whose domain is some hideous subset of the real numbers. Functions 
built up as is f in Eq. (4.2) are called algebraic. 

There is a simple function that will come in handy in Section 4.6. In its 
simplicity, it illustrates pretty well the sorts of things we have to show when 
verifying a mapping is a function and when showing a function has certain 
properties. 


Example 4.1.5 Let m and n be integers, where n is positive, and write N, = 
{1,2,...,n} and N”’ = {1+m,2+4+m,...,n+m}. Define a translation mapping 
T:N, — Ni? by T(x) = x + m. Show T is a function. 
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Solution We show that T satisfies properties F1—F2. 

(F1) Pick xe N,. Since 1 < x <n, it follows that 1+m<x+m<n-+m, or 
1+m < T(x) <n+m. Thus T(x) € NV’. 

(F2) Pick x1,x2 € N, and suppose x1 = x2. Then since addition is well defined, 
T(x1) = x1 +m = x2 +m = T(x2). Thus T is well defined. 


Notice that showing F1 in Example 4.1.5 involves not only showing that T(x) exists, 
but that it lies in the codomain. 
If a and b are real numbers such that a < b, then we use the following notation 
to represent certain subsets of the real numbers: 
[a, +oo) = {x €] 


(a, +00) = {x €] 


(—oo, b) = {x €] 
[a, b] = {x €] 


R 
R 
(co, b] = {x ER: x <5} 
R 
R 
(a,b) ={xeR 


EXERCISE 4.1.6 Show that each of the following are functions by showing they 
have properties F1—F2. 
(a) f : [0,1] — [0, 10] defined by f(x) =3x+4 
(b) f :R— [-2, 00) defined by f(x) =x? -1 
(c) f :R— R defined by f(x) = /1— x! 

If f(x) is a formula used to define a function whose domain and codomain are 
some subsets of the real numbers, then the natural domain of f is defined to be 
the largest subset of the real numbers for which f(x) exists as a real number. For 


example, if f(x) = x/(x + 2), then the rule for f can be used to generate a real 
number for all x 4 —2. Thus the natural domain of f is all real numbers except —2. 


EXERCISE 4.1.7 Determine the natural domain of each of the following 
functions. 


3x+5 
(a) fla) = 


(b) fix) = V¥F4 


1 See the comments after Definition 3.9.3. 
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1 
(c) 1) = ea 
(d) f@~) =V4- x 
-—1 
(e:) f= 5 
xe — x 


EXERCISE 4.1.8 Let A be any non-empty set, and define the identity mapping 
i: A— A by i(x) = x for all x € A. Show that the identity mapping is a function. 


Now we want to define a notion of equality for two functions. Certainly, we 
want the definition to be an equivalence relation. The standard definition of 
function equality is the following. 


Definition 4.1.9 Two functions f and g are said to be equal provided they have 
the same domain and codomain, and f(x) = g(x) for all x in the domain. 


To show that Definition 4.1.9 is an equivalence relation is not difficult. For 
example, to show E1, we note that f has the same domain and codomain as itself, 
and f(x) = f(x) for all x in the domain. Thus f = f. 


EXERCISE 4.1.10 Finish the proof that Definition 4.1.9 is an equivalence 
relation by showing it has properties E2-E3. 


At the beginning of this section, we said that mappings and relations are syn- 
onymous, even though they evoke different sorts of imagery. Some authors define 
a function as a special kind of relation, that is, a subset of A x B with certain 
properties. Instead of imagining elements of A being associated with elements 
of B via the input-output imagery of Definition 4.1.1, it is possible to define a 
function as a subset of A x B with two additional restrictions that coincide with 
properties F1—F2. 

For a relation RC A x B to be a function, property F1 says that, for all 
x €A, there must exist y € B such that (x, y) € R. Property F2 says that if 
(x, 1), (%, y2) € R, then yj = yp. 


EXERCISE 4.1.11 Let A= {1,5, 10} and B = {1, 2, 8, 9}. For each of the follow- 
ing relations, state whether it is a function. 

(a) 
(b) 
(c) 
(d) 


ds); 9)| 

(1,2), (10, 1), (5, 9)} 

(5, 1), (1, 1), (10, 2), (1, 8)} 
(5,2), (10, 2), (1, 2)} 


| 
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4.2 One-to-one and Onto Functions 


Property F1 says that a function f : A ~ B must map every element of A to some 
element of B. Property F2 says that this image must be unique, so that the path 
from x to f(x) must not have a fork in the road (Figure 4.2). Some functions 
have features that are analogous to Fl-F2 when viewed, as it were, in the reverse 
direction. Analogous to every element of A having an image in B, perhaps every 
element of B has a pre-image in A. A function with this special feature is said to be 
onto, so that Rng f = B. Also, analogous to the uniqueness of the image of every 
element of A, perhaps every element of Rng f has a unique pre-image. This type 
of function is called one-to-one. In a one-to-one function, the forbidden merging 
of the path from different elements of A to the same image in B is illustrated in 
Figure 4.3. In this section, we define these terms and look at features of functions 
that are one-to-one and/or onto. 


Definition 4.2.1 Suppose f : A > B is a function. Then f is said to be onto 
provided for every b € B, there exists a € A such that f(a) = b. That is, 


fisonto © (Vy € B)(ax € A)(y = fx) (4.3) 


; : t ings 3 
As a notational shorthand, we write f:A o. B. An onto function is also 


called a surjection. 


Contrast the definition of onto in (4.3) with property F1, which says 


(wx € A)(Gy € B)(y = f(x) 


Figure 4.3 A function that is not one-to-one. 
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Example 4.2.2 Show that f : R > R defined by f(x) = x° — 1 is a surjection. 


Solution You may recall from pre-calculus that the graph of a polynomial func- 
tion of odd degree extends all the way up and down the y-axis, so that any value 
chosen on the y-axis has a pre-image on the x-axis. But how do we prove f is 
onto? We have to choose an arbitrary y in the codomain and work backwards to 
find some x in the domain that maps to it. In other words, we must find a value of 
x that makes the equation y = f(x) true. Here is the proof. 

Pick any real number y, and let x = /y +1. By Theorem 3.9.1, x is a real 
number, and 


fx) = fWytD = l/y+ 1h -1l=y+1-l=y (4.4) 
Thus fisonto. 


EXERCISE 4.2.3 Show that the function T : N, > N”” from Example 4.1.5 is a 
surjection. 


EXERCISE 4.2.4 Let f : [0,2] — [1, 7] be defined by f(x) = 3x + 1. Show that 
f is a surjection. 


EXERCISE 4.2.5 Let A beanon-empt set,andi: A > A the identity function 
ply y 
(Exercise 4.1.8). Show that i is onto. 


EXERCISE 4.2.6 What does it mean to say that a function is not onto? 


EXERCISE 4.2.7 Show that f : [-2,3] — [0,10] defined by f(x) = x +1 is 
not onto. 


Definition 4.2.8 Suppose f : A > Bisa function. Then f is said to be one-to- 
one provided f(x) = f(x2) implies x1 = x» for all x1, x2 € A. That is, 


f is one-to-one = (Vx1, x2 € A)([f(a1) = f(x2)] > [x1 = x2]) (4.5) 


F ; 1-1 thads 
As a notational shorthand, we write f : A —> B. A one-to-one function is also 
called an injection. 


Note that the definition of one-to-one in (4.5) is the converse of property F2, 
which says 


(Wx1, x2 € A)([x1 = x2] > [fOr1) = f(x2))) 
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Example 4.2.9 Show that the function from Example 4.1.5 is an injection. 


Solution Pick x1,x2 € N, and suppose that T(x) = 7(x2). Then x1 +m= 
x2 + m,so that x1 = x2 by cancellation. Thus 7 is one-to-one. 


EXERCISE 4.2.10 The mapping f : R > R defined by f(x) = x° — 1 is a func- 
tion because it is algebraic and its natural domain is the set of all real numbers. 
Prove that f is one-to-one. 


EXERCISE 4.2.11 Let A be a non-empty set, and i: A— A the identity 
function. Show that 7 is one-to-one. 


With Exercises 4.1.8, 4.2.5, and 4.2.11, you have proved the following. 


Theorem 4.2.12 For a non-empty set A, the identity mapping is a one-to-one 
function from A onto itself. 


EXERCISE 4.2.13 What does it mean to say that a function is not one-to-one? 


EXERCISE 4.2.14 Show that f : [-2,3] — [0, 10] defined by f(x) = x* +1 is 
not one-to-one. 


F F 1-1 
If f : A > Bis both one-to-one and onto, we may write f : A —> B.A one- 
onto 


to-one, onto function is also called a bijection. A bijection from A to B is said to 
put the sets A and B into a one-to-one correspondence. 


EXERCISE 4.2.15 Here are several pairs of sets. 


(a) {1, 2,3, 4} and {a, b, c} 

(b) {1,2,3, 4} and N 

(c) ZandZ 

(d) Zand W 

(ec) RandR 

For each of the above pairs of sets, find four functions { f,, fo, 3, f4} from the first 
set to the second set with the following properties, if such functions are possible. 
It is not necessary to prove that your functions have the desired properties, but be 
prepared to provide as much explanation as necessary to support your claim. 

(i) f; is one-to-one but not onto. 


(ii) f2 is onto but not one-to-one. 
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(iii) £3 is both one-to-one and onto. 


(iv) f4 is neither one-to-one nor onto. 


EXERCISE 4.2.16 In this exercise, we want to consider functions f: S > R, 
where f is of the form 


ax +b 
cx +d 


fl) = (4.6) 


where c and d are not both zero and where S is the natural domain of f. Note that 
f is algebraic, so that it has properties F1-F2 on its natural domain. 


(a) First we determine S. 


(i) Suppose c = 0. By assumption, c and d are not both zero, so d 4 0. What 
is the natural domain of f? 


(ii) Suppose c 4 0. What is the natural domain of f? 


(b) What conditions on a, b,c, and d will guarantee that f is one-to-one? Deter- 
mine this condition, and then write a complete proof that f, with this 
additional hypothesis condition, is one-to-one.” 


c) From this point forward, assume that the condition from part (b) holds. Thus 
Pp p 
f is a one-to-one function from S to the real numbers. 


(i) Suppose c = 0. Show that f maps onto the real numbers. 


(ii) Suppose c 4 0. Determine the value of yo that has no pre-image in S, and 
prove that f maps S onto R — {yo}. 


Image and Pre-Image Sets 


Suppose f : A > Bisa function. Given A; C A, we sometimes want to talk about 
the subset of B that consists of the (unique) images of all the elements of A1. Also, 
given By C B, we sometimes talk about the subset of A that consists of all the pre- 
images of all the elements of B,. In this section, we define these set constructions 
and investigate some of their properties. 


Definition 4.3.1 Suppose f : A > Bisa function and A; C A. Then the image 
of A;, denoted f(A}), is defined by 


f(A) ={ye B: xe Ar) = f@))} (4.7) 
Thus y € f(Aj) if and only if there exists x € A, such that y = f(x). (See Fig. 4.4.) 


If the statement f(x) = f(x2) > x1 = x2 is going to be a true statement, what must be assumed about 
a, b,c, and d to make it so? Play with the statement f(x1) = f(x2). 
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Figure 4.4 The image of A, under f from Definition 4.3.1. 


With the notation of Definition 4.3.1, we can define the range of a function 
f: A> Bby 


Rng f = f(A) ={ye B: Axe A\(y= f@))} (4.8) 


EXERCISE 4.3.2 Define f:R—R and g:R>R by f(x) =x and g(x) = 
x> — x. Let Ay = {—2, —1,0, 1, 2}. Construct f(A1) and g(A}). 


EXERCISE 4.3.3 Suppose f : A > Bisa function, and let A; and A2 be subsets 
of A. In the following two set equality statements, three of the four subset inclusion 
statements are true, and one is false. Prove the three that are true, and provide a 
counterexample to demonstrate that the fourth is false. 


(a) f(A, Az) = f(A1) 9 f(A2) 
(b) f(A1 U Az) = f(A1) U f(A2) 


EXERCISE 4.3.4 Suppose f : A > B is a one-to-one function. Show that the 
false subset inclusion statement from Exercise 4.3.3 is true with this additional 
assumption. 


EXERCISE 4.3.5 Let f :A— Bbea function and let F = { Ag }uc,4 be a family 
of subsets of A. For the three subset inclusion statements in Exercise 4.3.3 that 
are true, state and prove analogous theorems for F. 


EXERCISE 4.3.6 Suppose f: A — B is a function and A; C Az CA. Then 
f(A1) © flA2). 
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EXERCISE 4.3.7 Suppose f : A > Bisa function and Aj C A. 


(a) The statement fA =[f(Ay)]© is not true in general. In fact, neither 
direction of the subset inclusion is true. Demonstrate these facts by con- 
structing a counterexample to each of the statements fA) C [f(A1)]© and 


FAQ) 2 [fADIE. 


(b) Determine an additional hypothesis condition on f that will guarantee 
f(A) C [f(ADIS, and prove your claim. 


(c) Determine an additional hypothesis condition on f that will guarantee 
fA > [f(A1)]©, and prove your claim. 


Now we turn to the pre-image set of a subset of the codomain. 


Definition 4.3.8 Given a function f : A — B, and B, C B, we define the set 
f hv: the pre-image of B, by 
f-\(By) = {xe A: fl) € Bi} (4.9) 


Thus x € f—!(B}) if and only if f(x) € Bi. If Bi = {y} has only one element, we 
generally write f—!(y) instead of f~!({y}). (See Fig. 4.5.) 


EXERCISE 4.3.9 Define f:R—R and g:R—>R by f(x) =x’ and g(x) = 
x — x. Let By = {—9, 0, 4}. Determine f~!(B,) and g—!(0). 


EXERCISE 4.3.10 Suppose f: A — B is a function and B, C Bo C B. Then 
f-"(Bi) & f-\(Ba). 


EXERCISE 4.3.11 Let f:A— B be a function, and suppose B, and B2 are 
subsets of B. Prove the following set equality statements, if possible. If you claim 


f 


a 


Figure 4.5 The pre-image of By under f from Definition 4.3.8. 
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that any of the subset inclusion statements are false, provide a counterexample 
to verify your claim. 

(a) f7'(BiO By) = f-'(B1) 9 f-"(B) 

(b) f~'(B1 U Ba) = f-'(Bi) U f-"(Ba) 

(c) f-\(BP) = LBD Ie 


EXERCISE 4.3.12 Define f: R > R by f(x) = x2. Let Ay = {—1, 2} and B; = 
{—9, 0, 4}. Determine f—![f(A1)] and f[f~!(By)]. 


EXERCISE 4.3.13 Suppose f : A > Bisa function, Ai C A and B C B. 


(a) Prove the following. 


(i) Arc f'L f(A] 
(ii) FIP (BDIS By 

(b) Exercise 4.3.12 shows that the reverse subset directions from part (a) do not 
hold. However, each of these can be proved in the reverse direction with one 
additional hypothesis condition. Determine the needed additional condition 


on f for each of the subset statements that will make the reverse subset 
inclusion statement true. Prove your claims. 


EXERCISE 4.3.14 Let f: A — Bbea function, A; C A and B, CB. 


(a) If x € Aj, what can you say about f(x)? 

(b) If y € f(A1), what does this mean? 

(c) Ifxe Aand f(x) € f(A1), then is it necessarily true that x € A1? 
(d) Ifx € f—'[f(A1)], then is it necessarily true that x € A1? 

(e) If.x € Aj, then is it necessarily true that x € f~![f(A1)]? 

(f) Ifx € f—'(B,), then what can you say about f(x)? 

(g) If f(x) € By, then what can you say about x? 

(h) If y € By, then is it necessarily true that y € f[,f~!(B,)]? 

(i) If y e f[ f~'(By)], then is it necessarily true that y € B,? 


Composition and Inverse Functions 


A function is a sort of linking of a set A to a set B with certain restrictions. If A 
is linked to B by a function f, and B is linked to C by a function g, then we can 
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link A to C via B, using f and g in succession. Composition is the term we use 
when combining functions in this way. In this section we will define composition 
of functions and then investigate the possibility of using f : A > B to link B back 
to A by reversing the function f. Such a reverse function, if one exists, is called an 
inverse of f. 


Composition of Functions 


Suppose f : A > Bandg: B > Care functions. We want to define a new mapping 
go f:A-— C that uses the rules of f and g together as in Figure 4.6. 


Definition 4.4.1 Givenfunctions f : A > Bandg: B > C,we define the com- 
position go f : A— C to be the mapping defined by (g 0 f)(x) = g(f(x)) for all 
xeA. 


Example 4.4.2 Consider the functions f : R— R* defined by f(x) =x7 +1 
and g: Rt — R defined by g(y) = //y. Then (go f)(x) =Vx7+1. 


Notice we did not presume to say that go f is a function in Definition 4.4.1. 
We must use the fact that f and g are themselves functions to show that properties 
F1-F2 hold for go f. 


EXERCISE 4.4.3 Given functions f:A— B and g:B—-C, the mapping 
go f:A-— C from Definition 4.4.1 is a function. 


We want to address several questions that involve relationships between f, g, 
and go f based on their individual characteristics. For example, if f and g are 
one-to-one, does it follow that g o f is one-to-one? What about onto? If you know 
something about f and g o f,can you conclude anything about g? In the following 
exercise, we compose several statements of this sort. We call them questions 
(Q1-Q6) because you must determine whether they are true or false. Not only 
are some of them true, but some are true even in the absence of one of the given 


Figure 4.6 Composition: (go f): A> C. 
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hypothesis conditions. As an example, we prove that the first of the statements 
is true. 


EXERCISE 4.4.4 Let f:A— Band g: B= C be given functions. Prove each 
of the following statements, if possible, or provide a counterexample to demon- 
strate that the statement is false. If a proof does not require one of the provided 
hypothesis conditions, note how the hypothesis may be relaxed and still be a true 
statement. 


(Q1) If f and g are one-to-one, then go f is one-to-one. 


Proof. Suppose f and g are one-to-one functions, and suppose (go f) 
(x1) = (g o f)(x2). Thus g[ f(x1)] = gL f(x2)]. Since g is one-to-one, we have 
f(x1) = f(x2). Similarly, since f is one-to-one, it follows that x1 = x2. Thus 
go f is one-to-one. 


(Q2) If f and go f are one-to-one, then g is one-to-one. 
(Q3) If g and go f are one-to-one, then f is one-to-one. 
(Q4) If f and g are onto, then go f is onto. 
(Q5) If f and go f are onto, then g is onto. 
(Q6) If g and go f are onto, then f is onto. 


EXERCISE 4.4.5 Of the Q1-Q6 that are false, one hypothesis condition can 
be replaced with a different condition to arrive at a true statement. Determine 
the needed hypothesis condition for each of these, and prove that the resulting 
statement is true. 


Inverse Functions 


If f : A— Bisa function, then the rule linking A to B is well defined (F2) on all 
of A (F1). We now ask a question from the perspective of B. Given f : A > B, 
can we find a function g: B > A whose rule is the exact reversal of the rule for 
f? For example, if f(8) = —2, then we want g(—2) = 8. Some functions have an 
inverse, and some do not. First, we establish the criteria by which a function g 
qualifies as an inverse of f; then we present a theorem that states when such an 
inverse exists, and that it is unique. 


Definition 4.4.6 Suppose f : A > Bisa function. Then we say that a function 
g: B-— Aisan inverse of f if (go f)(x) = x for all x € A and (f 0 g)(y) = y for 
all y € B. Such a function g is generally denoted f—!. 


We must make a careful distinction at this point. In Section 4.3, we used the 
notation f—!(B;) to represent the pre-image set of a subset of the codomain of f. 
This notation has nothing to do with an inverse function f—! per se, and the 
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notation f—!(B,) can be used for any function f, even if no inverse function for f 
exists. Furthermore, in Section 4.3 we used the notation f—!(y) to represent the 
pre-image set of the single element set {y}, which quite honestly is an abuse of 
notation. To write f—!(8) appears to be the evaluation of a particular function at 
a particular point, but it is not necessarily so. If you do not already know that f 
has an inverse function, then f—!(8) is the subset of the domain of f consisting of 
all elements that map to 8 under f. However, if a function f—! does exist, then 
f (8) is indeed a unique value. 

Here is the theorem that states precisely when a function has an inverse and 
that such an inverse is unique. 


Theorem 4.4.7 Suppose f : A > B is a one-to-one function from A onto B. 


Then there exists a unique inverse function f—! : B —> A. 


The proof of Theorem 4.4.7 has several parts to it. You will supply all the details 
in the next exercise, with the help of an outline. Our approach will be to define 
a new function g: B > A by appealing to f, then show that g is a one-to-one 
function from B onto A. Next, by showing that g has the properties required by 
Definition 4.4.6, g will have earned the right to be called an inverse of f. Finally, 
you will show that the inverse is unique. 

To define the rule for g: B > A, we must pick y € B and explain how to find 
some x € A that we will call g(y). We do this in the following way. Given any y € B, 
we define g(y) to be any solution x to the equation f(x) = y. 


EXERCISE 4.4.8 Complete the following proof of Theorem 4.4.7 by supplying 
all the missing details. 


Proof of Theorem 4.4.7. Let f : A = B be a given function. Define a mapping 
g: B > Ain the following way. For a given y € B, let g(y) be any solution x to the 
equation f(x) = y. 

(a) Show that g is defined on all of B(F1). 

(b) Show that g is well defined (F2). 

(c) Show that g is one-to-one. 

(d) Show that g is onto. 

(e) Show that (go f)(x) = x for all x € A. 

(f) Show that (f o g)(y) = y forall ye B. 


(g) Finally, suppose g; and g2 are two functions from B to A that satisfy 


(g1 0 f)(x) = (82 0 f)(x) = x for all x € A and (fo g1)(y) = (fo ga\(y =y 
for all y € B. Show that g1 = g2 by showing gi(y) = g2(y) forall ye BS 


3 All you need are that (f o g1)(y) = (f 0 g2)(y) = y for all y € B and that f is one-to-one. 
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The next exercise brings together many of the results you have derived to this 
point. Its proof should require little more than an appeal to several results you 
have already proved. 


EXERCISE 4.4.9 For non-empty sets A and B, define A = B provided there 
exists a one-to-one function from A onto B, that is, there exists a bijection from A 
to B. Then = defines an equivalence relation on the family of all non-empty sets. 


Three Helpful Theorems 


In Sections 4.6 and 4.7 we will discuss set cardinality, which is a fancy expression 
for the number of elements in a set. In Section 4.6 we define what it means for a 
set to be finite, and in Section 4.7 we look at infinite sets. In preparation for the 
definition of finite set and the rigor we require in proving theorems about finite 
sets, three preliminary theorems will be helpful. 

These theorems are not the prettiest ones you will ever see in your mathemat- 
ical life, but they are not the ugliest either. They are not complicated in principle, 
but proving them requires slavish attention to some rather minute details. If noth- 
ing else, they illustrate the inescapable fact that laying the groundwork for later, 
more elegant results sometimes requires you to muddle your way through prelim- 
inary theorems whose truth is transparent, but whose proofs are a bit sloppy and 
involve multiple cases. You will prove the first of these theorems. The proof of the 
second will be provided here, because the sneaky technique it employs will come 
in handy as you prove the third. 

Our first result says that if you are given two one-to-one, onto functions defined 
from disjoint domains to disjoint codomains, then it is possible to paste together* 
the domains, paste together the codomains, and then define a single function from 
the pasted domains to the pasted codomains that is also one-to-one and onto. 


EXERCISE 4.5.1 Suppose f; : A; > B, and fz: Az > Bo are bijections, and 
that Ay 1 A2 = By By =. Define f : Ay U Az > By U Bp by 


= |A@), Wee Ay 
ces me ee 


Then f is a bijection from A, U Az to By U Bo. 


In Section 4.6 we will talk about sets that contain n elements. We place this on 
firm mathematical footing by using the set N, = {1,2,...,} as the domain of a 
one-to-one function from N,, onto the given set. The creation of such a function 
formalizes the way you would count the elements of the set one ata time 1, 2,...,7 
and then declare that the set has n elements. It also focuses the entire discussion 
of all finite sets onto the family of sets of the form N,, for all positive integers n. 


4 That is, form the disjoint union. 
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The next helpful theorem will allow us to prove in Section 4.6 that any subset 
of a finite set is finite. That is, if B has n elements and A is a subset of B, then 
there is some m <n such that A has m elements. We prove this theorem here, 
for it is a little sticky and will prove to be a good warm-up for proving the 
third. 


Theorem 4.5.2 Let n be a positive integer, and let S CN, be any non-empty 
set. Then there exists a positive integer m <n and some f : S > N,, where f isa 
one-to-one, onto function. 


Proof. We prove by induction on n > 1. 


(11) Ifn =1, then S=N, = {1}. Letting m = 1 we may define f : S > N,, by 
f(1) = 1, which is clearly a one-to-one, onto function. 


(12) Suppose n > 1 and that the result is true for any non-empty subset of N,. 
Let S be any non-empty subset of N,+1, and consider the following three 
cases. 


If S = Nya, then we may let m=n+1 and f: S > N,, be the identity 
function. 


If SC Nyy andn+1¢ 5S, then § CN,. Thus the inductive assumption 
applies, and there exists m <n anda function f : S$ = Nin- 
onto 


If SC Nyy and n+1€S, then there exists k e¢ N, such that k ¢ S. Let 
T = [SU {k}] — {n + 1}. (Draw a picture!) Then T C N,, and the inductive 


; : ; : 1-1 
assumption applies to yield m <n and a function f : T —> N,,. Define 
onto 


g:S—> Nm by 


_ fe, ifxeT—{h 


8) = Vw, ifxant 


(4.10) 


Defining A, = T — {k}, Az = {n+ 1}, BL = Nn — (f(}, and Bp = {fk}, 
we may apply Theorem 4.5.1 to conclude that g: S > N,, is one-to-one 
and onto. 


The last theorem of this section claims that a certain type of one-to-one, onto 
function does not exist. It will allow us to prove in Section 4.6 that the number of 
elements in a finite set is well defined. Thus if Nelson says a set has n elements, 
while Minnie says it has m elements, then the only way they can both be right 
is if m =n. A suggestion for the proof of Exercise 4.5.3 is to use induction on 
m>1. 
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EXERCISE 4.5.3 Suppose m and n are positive integers with m < n. Then there 
is no bijection f : N, > Nino’? 


Finite Sets 


What does it mean to say that a set has n elements? Or given two sets, what does it 
mean to say they have the same number of elements? The term that denotes the 
number of elements in a set is cardinality. Consider the following sets. 


A={-5,-2,1,4,7,10} and B= {b,d,h,p,f,]} (4.11) 


What would you say is the cardinality of A? Would you be inclined to say that A and 
B have the same cardinality? If the cardinality of a set is finite, whatever that might 
mean, then saying it has cardinality n ought to be a pretty straightforward term 
to define. Also, if two sets are finite, then saying they have the same cardinality 
ought to suggest a pretty natural relationship between them. 

To determine the cardinality of A defined in Eq. (4.11), you probably counted 
six elements, then used that as a basis for saying A has cardinality six. Doing 
the same for B, you then could say that A and B have the same cardinality. 
To count elements in this way is precisely the motivation behind the following 
definition. 


Definition 4.6.1 Let A beaset, and suppose there exists a positive integer n and 
a bijection f : N, — A. Then we say that A is a finite set and that it has cardinality 
n, which we write |A| = n. For the empty set, we make a special definition. Let No 
be defined as the empty set, and define the empty mapping f : No > @ in order to 
say that the empty set is finite and has cardinality zero. 


To define f : No > @ as an empty mapping does not really jibe with Def- 
inition 4.1.1, for in our definition of function, we required that domains and 
codomains be non-empty. But there is no reason we cannot extend the definition 
of function to include this case, as long as we make sure that it has properties 
F1-F2. And it does. In fact, the empty mapping meets all the requirements 
for being a bijection from the empty set to itself, because all the requirements 
involve the universal quantifier. For example, the empty mapping is onto, for 
if it were not, then there would exist some y € 9 which has no pre-image. But 
no such y exists, so the empty mapping is onto. This illustrates an odd truth in 


Prove the inductive step by contrapositive. 

If f : Ny41 > Nm41 is one-to-one and onto, and if f(n+1)=m-+1, then defining g should be 
easy. Otherwise, f(n + 1) =k for some 1 <k <m and f(t) =m-+1 for some 1 < £ <n. Define g 
to map £ tok. 
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mathematics: Statements of the form (Vx € 4)(P(*)) are true, regardless of P(x). 
For example, every negative natural number is a proper divisor of 11. 


Example 4.6.2 Show that the set B in (4.11) has cardinality 6. 


Solution Define f : Ne > Bin the following way. Let f(1) = b, f(2) =d, f(3) = 
h, f(4) = p, f(5) =f, f(6) =1. Since f is one-to-one and onto B, we have shown 
|B|=6. Hf 


Example 4.6.3 By the identity mapping, |N,|=7n. 


Definition 4.6.1 introduces two terms concerning the size of some but not all 
sets: finiteness and cardinality. To say a set A is finite is to mean that it has some 
nonnegative integer n associated with it, called its cardinality, which derives from 
the existence of some bijection from N, to A. Conversely, to say that |A| =n, 
where n is a nonnegative integer, is to say that A is finite. Notice that the empty 
set is the only set with cardinality zero. For if A is any non-empty set, then no 
function from No to A can be onto. 

With Example 4.6.2, we have earned the right to make a statement such as “B 
has six elements,” and we have validated our inclination to determine its cardinality 
by walking our fingers over its elements and counting them. We must be careful, 
though. The existence of a bijection from N, to A is a basis for declaring the 
cardinality of A to be n. But how do we know that there is not some different m 
and a bijection from N,, to A? If there were such an m, then cardinality would 
not be well defined, for |A| could be two different numbers. With the help of 
Exercise 4.5.3, your proof of the next result should be quick. 


EXERCISE 4.6.4 The cardinality of a finite set A is well defined. That is, if 
y 
|A| =m and |A| =n, then m =n.’ 


Exercise 4.4.9 allows us to say that the family of all sets is partitioned into 
equivalence classes based on the existence of bijections between them. Now we 
can see what some of these equivalence classes are. Definition 4.6.1 says that 
the cardinality of some sets is a nonnegative integer n, provided the set is in 
the equivalence class of N,,. Furthermore, by Exercise 4.6.4, that n is unique. So 
if m #n, then N,, and N, are representative elements of different equivalence 
classes. With this, we have proved the following. 


Theorem 4.6.5 Suppose A is a finite set, and B is any set. Then A and B have 
the same cardinality if and only if there exists a bijection from A to B. 


Corollary 4.6.6 If f : A— Bisa one-to-one function, then |A| = |Rng Fis 


7 Address the empty set as a special case. Otherwise, if the result is not true, it violates Exercise 4.5.3. 
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The next exercise says that if A and B are disjoint sets, and if there exist 
bijections f,:N,— A and fo: N, — B, then you can construct a bijection 
f :Nntn > AUB. Once you have defined such a function, showing it is a bijec- 
tion from Nini, to AU B should amount to little more than calling on previous 
exercises. 


EXERCISE 4.6.7 If A and B are disjoint finite sets with cardinalities m and n, 
respectively, then |A U B| = m +n8 


EXERCISE 4.6.8 If |B] =n and A C B, then A is finite and satisfies |A| < n.° 


Corollary 4.6.9 Suppose |A| = 7 and C is any set. Then |AN C| < |A|. 


Proof. Since ANC C A, the result is immediate from Exercise 4.6.8. 


EXERCISE 4.6.10 Suppose U is a finite universal set, and A C U. Then A and 
A© are both finite, and |A| + |AC| = |UI. 


EXERCISE 4.6.11 Suppose |A| = |B| =n, and suppose f : A > B is any one- 
to-one function. Then f is onto.!° 


EXERCISE 4.6.12 If|A| =m and|B| =n,then|AU B| < m+n! 


EXERCISE 4.6.13. The union of a finite number of finite sets is finite. ! 


Infinite Sets 


The following definition should not be too surprising. 


Definition 4.7.1 A set is said to be infinite provided it is not finite. 


By negating Definition 4.6.1, we see that a set A is infinite provided that for 
every nonnegative integer n, every function from N,, to A fails to be either one- 
to-one or onto. In reality, if there were some f : N, — A that is onto but not 
one-to-one, we could create a function f; : N,, > A forsome m < n that culls out 


8 Make use of Exercise 4.5.1 where the domain of f is Nm+n and the range is A U B. The function 
T : Nn > N?’ from Example 4.1.5 should be helpful, too. 


9 If f : Ny > Bisa bijection, apply Theorem 4.5.2 to f~!(A). Construct g : Nin aa A by composition. 


10 What if f is not onto? 
1 See Exercise 3.2.14. 
12 Use induction and Exercise 4.6.12. 
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the repetition in f and is therefore one-to-one.!? By creating f; in this way, we 
would have that A is finite. The point is stated in the following. 


Theorem 4.7.2 A set A is infinite if and only if for every nonnegative integer n 
and every function f : N, — A, f is not onto. 


Loosely speaking, if a set is infinite, then no matter how large n is, there are not 
enough elements in N,, to tag all the elements of the set. 

Before we get into the interesting results of infinite sets, let’s point out where 
it will lead. Strangely, just because two sets are both infinite, it does not follow that 
they have the same cardinality, in the sense that there is a one-to-one correspon- 
dence between them. Some infinite sets are actually bigger than others. This makes 
for some real surprises and motivates us to discuss different orders of infinity. In 
fact, it is possible to generate an infinite sequence of infinite sets A;, Az, ... where 
|A;| is a higher order of infinity than |A,_;|. It’s mind boggling. Not only is there 
more than one size of infinity, but also there are infinitely many infinities. A nat- 
ural question then arises: Of the infinitely many different orders of infinity, which 
one represents the number of distinct infinities that exist? We will look at only 
two sizes of infinity. Here is our first. 


Definition 4.7.3 Suppose A is an infinite set. Then A is said to be countably 
infinite provided there exists a one-to-one function from the positive integers N 
onto A, and we say that A has cardinality Xo (8 is the Hebrew letter aleph). If A 
is finite or countably infinite, we say that A is countable. If A is not countable, we 
say that it is uncountable. 


Example 4.7.4 By the identity function, the natural numbers are countably 
infinite. li 


EXERCISE 4.7.5 Show that the whole numbers are countably infinite. 


The equivalence relation from Exercise 4.4.9 allows us to say that the natural 
numbers are a representative set from the equivalence class of all countably infinite 
sets. Let’s see some other sets that are countably infinite and prove some theorems 
about countable sets in general. Remember, to show a set A is countably infinite, 
your task is to construct a one-to-one function from the positive integers onto A. 
Sometimes an easy way to build such a function is by systematically declaring the 
values of f(1), f(2), f(3), and so on, which is the same as ordering the elements 


We begin by creating a subset § C Ny, so that f : S > A is one-to-one by applying the axiom of choice 
(discussed on page 118). For each a € A, we could take a single element of f—!(a), and then let 
S consist of all these chosen pre-images. Then we could apply Theorem 4.5.2 to conclude the exis- 
tence of a bijection g : S > N» for some m <n. Then f; = fo gl would be a one-to-one function 
from Nj» to A. 
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of A. This ordering will correspond to a one-to-one, onto function provided every 
element of A appears in the ordering exactly once. Once you have done this, you 
might be able to concoct an explicit formula for f(). In the next exercise, you 
might find a way to define the function by 


ifn is even 
? ifnis odd 


fn) = 


then apply Exercise 4.5.1. 
EXERCISE 4.7.6 The integers are countably infinite. 


If we expect to stumble eventually onto an uncountable set, the rationals might 
seem to be the first one we would find. After all, in Exercise 2.2.9, you showed 
that a < (a+ b)/2 < b,so that between any two rational numbers there is another 
rational number. Thus the rational numbers are strewn densely up and down the 
real number line, as opposed to the integers, which have plenty of room in between 
each one. Well, the rationals are countable. In Theorem 4.7.7 we prove that the 
positive rational numbers are countable. Although it might sound counterintuitive, 
it is possible to list the positive rationals sequentially as f(1), f(2), f(3), and so on, 
to create a function that demonstrates countability. 


Theorem 4.7.7 The positive rationals are countable. 


Proof. Consider Figure 4.7, which lists all the positive rational numbers. From this 
figure, we may define f in the following way. Starting in the upper left corner of 


Figure 4.7 Ordering of Q* to show countability. 
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the table, define f(1) = 1/1. We then move down successive diagonals from upper 
right to lower left, defining f(2) = 1/2 and f(3) = 2/1 from the first diagonal. 
Moving down the next diagonal, we define f(4) = 1/3. But since 2/2 = 1/1, we 
skip 2/2 and define f(5) = 3/1. Continuing in this fashion, 


1 2 3 4 1 5 


and so on, making sure f is one-to-one by skipping over rational numbers that 
are not in reduced form. This program guarantees that f is one-to-one and onto, 
so we have shown that the positive rationals are countable. 


Now let’s pause for a moment and reflect on the proof of Theorem 4.7.7. We 
can think of constructing the one-to-one correspondence as sequencing the ele- 
ments of A as (a1, a2, a3, ...), where every element of A is in the list exactly once. 
If such a listing of elements of A can be found, then you have shown that A is 
countably infinite. 

In tracing through the grid in the proof of Theorem 4.7.7, we ensured that f is 
one-to-one by skipping over entries that were not in reduced form. If we had not 
done this skipping, the resulting f would still have been onto, but not one-to-one. 
Knowing a priori that the positive rationals are infinite, this failure of f to be 
one-to-one would not be particularly disconcerting. Loosely speaking, here’s why. 
If there are enough positive integers to tag all the positive rational numbers with 
some repetition, then there ought to be enough to tag them without repetition. 
The next theorem gives us the freedom not to worry about this repetition and will 
save us from some minor headaches later. It deals with all countable sets, whether 
they are finite or countably infinite. We state it here without proof. 


Theorem 4.7.8 A non-empty set A is countable if and only if there exists a 
function from the positive integers onto A. 


The convenience of Theorem 4.7.8 will become apparent in the following 
results, where we need not worry if a given countable set is finite or count- 
ably infinite. For example, in Exercise 4.7.9, given two countable sets A and B, 
Theorem 4.7.8 will allow you to prove that A U B is countable by constructing a 
mapping from the positive integers onto A U B without having to worry whether A 
and B are disjoint or whether they are finite or countably infinite. Since the union 
across a family of sets is unchanged if all empty sets in the family are omitted, we 
are free to assume that all sets in the following exercises are non-empty. 


EXERCISE 4.7.9 If A and Bare countable, then A U B is countable. 


EXERCISE 4.7.10 If {Ax}7_, 18 a finite family of countable sets, then U?_, Ax is 
countable. 
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EXERCISE 4.7.11 If {An }f<, is a countably infinite family of countable sets, 
then U%’ , Ay is countable. !4 


By Exercise 4.7.10, it follows that the rationals are countable, for Q= 
Q*t UQ@ U {0}, and Q™ is countable in the same way that Q* is. 

If every function f : N > A fails to be onto, then A is uncountable. This is how 
we prove the following. 


Theorem 4.7.12 [0,1] is uncountable. 


Proof. Suppose f : N — [0,1] is any function. We can show that f fails to be 
onto. From assumption A21 in Chapter 0, we may represent [0, 1] as the set 
of all decimal expansions of the form 0.XXXXX.... Consider the following 
diagram. 


fQ) =0.a,ana3aq... 
f(2) = 0. by by b3 by... 
FO) = 0) Cher ey eae, 
f(4) = 0. dy dy d3 dy... 


FS) = 0. e1 e2 63 eq... 


We can show f is not onto by constructing a real number in [0, 1] that is not in the 
list. Let x = 0.x1x2x3x4... where x1 4 a1, x2 4 b2, x3 # c3, etc.; and no x, = 9,15 
Then x € [0, 1], and x is different from every number in the range of f. Thus [0, 1] 
is not countable. 


EXERCISE 4.7.13 If Bis countable and A C B,then A is countable.!® 
Corollary 4.7.14 The real numbers are uncountable. 


Proof. Since [0,1] is uncountable and [0,1] C R, the result follows from 
Exercise 4.7.13. 


EXERCISE 4.7.15 The irrationals are uncountable. 


14 Induction won’t help here. Try a grid like that in the proof of Theorem 4.7.7. 
15 This way we guarantee that x does not settle into the pattern of the repeating 9. 
16 Use Theorem 4.7.8 to take care of the case where is non-empty. 


144 


4.8 


4.8.1 


Chapter 4 Functions 
Cartesian Products and Cardinality 


Combinatorics is the area of mathematics that deals with techniques for counting. 
In this section and the next, we study some useful counting techniques whose 
mathematical foundation lies in the cardinality of Cartesian products and the 
number of functions from one finite set to another. 

To simplify some of the language and notation, if A is a finite set and f is a one- 
to-one function from N, onto A, we will address elements of A as {a1, a2, ..., An}, 
where a; = f(k) for 1 <k <n. Since f is a function, every k can be used to refer- 
ence a unique a; € A. Since f is one-to-one and onto, every element of A can be 
addressed as a; for some unique k. 


Cartesian Products 


In Section 3.11, we defined the Cartesian product of two sets A and B as 
Ax B= {(a,b):aE€A,be B} (4.12) 


the set of ordered pairs whose first coordinate comes from A and whose second 
coordinate comes from B. Assuming A and B each have a form of equality defined 
on them, which we temporarily write as =4 and =,, we can define equality in 
A x Bby 


(a1, b1) =Axp (a2,b2) ifandonlyif a, =,4 a2 and bi =, b2 (4.13) 


EXERCISE 4.8.1 Assuming =, and =, are equivalence relations on A and B, 
respectively, show that =4, 2 defines an equivalence relation on A x B. 


From this point forward, we will simply write that (a1, b1) = (a2, b2) provided 
a = a2 and b, = bp. Our first theorem will serve you as the root of an induction 
argument for Exercise 4.8.5. 


Theorem 4.8.2 Suppose |A| = 1 and |B| =n > 1.Then|B| = |A x BI. 


We prove Theorem 4.8.2 by constructing a bijection from B to A x B, thereby 
demonstrating that B and A x B are in the same cardinality class. 


Proof. Write A = {a} and B = {b1, b2,...by}. Define f: B—> Ax B by f(by) = 
(a, by). First, for any by € B, (a, bg) € A x B,so f has property F1. Also, if bj = by, 
then f(b;) = (a, bj) = (a, by) = f(bx), so that f has property F2. If we suppose 
f(bj)) = fbx), then (a, bj) = (a, bg), which implies b; = by, and f is one-to-one. 
Finally, if we pick any (a, by) € A x B, then by € Band f(bx) = (a, by), so that f 
is onto. Since we have found a one-to-one correspondence between B and A x B, 
they are in the same cardinality class, and |B| = |A x By. 
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EXERCISE 4.8.3 If Aj, Az, and B are sets, then 
(Ay U A2) x B= (Aq x B)U (Az x B) (4.14) 
Furthermore, if Ai and A2 are disjoint, then so are (A; x B) and (A2 x B). 


Corollary 4.8.4 If A;, Az, and B are finite sets, where A; and A? are disjoint, 
then |(Aj U Az) x B] = |Ay x B] + Ao x BI. 


Proof. By Exercises 4.6.7 and 4.8.3, 


|(Ay UA) x B| = |(A1 x B)U(A2 x B)| = |A1 x B] + Az x BI (4.15) 


You now have all you need to write an induction argument for the following. 


EXERCISE 4.8.5 Suppose A and B are non-empty, finite sets. Then 
|A x B| = |A| x |B|!” (4.16) 


We can form the Cartesian product of more than two sets, but to do it conve- 
niently we need to be a bit less than rigorous. We want A x B x C to be the set of 
ordered triples (a, b,c) wherea € A,b € B,andc € C. But to define A x B x Cin 
terms of the binary Cartesian product defined in Eq. (4.12), we need to associate 
either A and B, or B and C. This leaves us with (A x B) x C or A x (Bx CQ), 
which, unfortunately, are different. As we defined the Cartesian product 
in Eq. (4.12), 


(A x B)x C= {((a,b),c): aE A, be B,c EC} (4.17) 
but 
Ax (Bx C) ={(a, (b,0)) :a€ A, bE Bc eC} (4.18) 


Expressions of the form ((a, b), c) and (a, (b, c)) are not the same, though it does 
not create an insurmountable obstacle. Rather than try to deal with the lack 
of associativity in the Cartesian product of sets, we make the following recur- 
sive definition and then give ourselves the freedom to ignore the associativity 
question. 


17 Induct on |A| > 1, thinking of | B| as fixed. 
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Definition 4.8.6 Suppose {A,} is a family of sets indexed by the positive 
integers. Then we define the sequence of Cartesian products recursively as 


1 
[ [4<=41 (4.19) 
oa 
n+1 n 
[ [ 4« = ([] 40) x Angi forn > 1 (4.20) 
ke]. k=1 


With Definition 4.8.6, elements of [];_, {Ax} for the first few values of n would 
look like this: 


+ 


Ag = {a:ae Aj} 


> 
Il 
e 


Ag = {(a, a2) : ax € Ax for 1 < k < 2} 


amis 


> 
Il 
aan 


(4.21) 


em 


Ax = {((a1, 42), 43) : ax € Ax for 1 < k < 3} 


> 
Il 
e 


aml 


Ax = {(((a1, 42), 43), a4) : ax € Ag forl <k < 4} 


> 
Il 
es 


Instead of writing elements of [];_, Ax this way, we give ourselves the freedom 
to address elements as (a1, a2,..., Gy), n-tuples where ax € Ax for all 1 <k <n. 
With this definition and another induction argument, you can prove the following. 


EXERCISE 4.8.7 Suppose {A;}/_, is a finite family of finite sets. Then 


n 
[] A 
k=1 


és I] |Ax| (4.22) 
k=1 


Functions Between Finite Sets 


Suppose A and B are non-empty sets with |A| = m and |B| =n. We can now 
calculate the number of functions from A to B in the following way. Form the 
Cartesian product of m copies of B, which we may write B”. By Exercise 4.8.7, 
|B” | =n". Furthermore, any ordered m-tuple from B” can be thought of as a 
function f : A > B, where the first coordinate of the m-tuple is f(a,), the second 
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coordinate is f(az), and so on. Since distinct m-tuples from B” represent distinct 
functions from A to B, we have proved the following. 


Theorem 4.8.8 If A and B are non-empty sets with |A| = m and |B| =n, then 
there are n” distinct functions from A to B. 


Here is an informal way to think of Theorem 4.8.8. Let A = N,, and imagine 
we have m empty slots numbered 1, 2,...,m. Into each slot we insert precisely 
one element of B, where any element of B may be used repeatedly if we like. 
Filling the slots in this way is equivalent to defining a function f : N,, — B. There 
are n choices of element to place in the first slot. For each of these ways to fill the 
first slot, there are n ways to fill the second slot, so that there are n* ways to fill 
the first two slots. Continuing, we have that there are n” ways to fill all m slots. 
Theorem 4.8.8 motivates the notation B4 to represent the set of all functions from 
A to B. 

What if we must fill each of the n slots with one of the m elements of B, but 
we are not allowed to use any element of B more than once? How many ways can 
this be done? First note that if such a task can be done at all, it must be that m < n. 
We have a choice of n objects for the first slot. Then, regardless of which element 
of B was placed in the first slot, there are n — 1 possible elements for the second 
slot, and so on. Thus the number of ways to fill the m slots with distinct elements 
of Bisn(n —1)(n—2)...(n —m +1). We write this as 

POL Gs Gr On eee (4.23) 
(n —m)! 
which is called the number of permutations of n objects taken m at a time. Filling 
the slots with elements of B so that there is no repetition is the same as constructing 
a function from B to N,, that is one-to-one. With this, we have an informal proof 
of the following. A rigorous proof should be done by induction. 


EXERCISE 4.8.9 Let A and Bbe non-empty sets with |A| = m <n = |B|. Then 
there are P(n, m) distinct one-to-one functions from A to B.!8 


If Eq. (4.23) is to be taken as meaningful for m = 0, it implies that there is one 
way to arrange zero of n objects. It also suggests that there exists a unique (empty) 
function from the empty set to any finite set. 

If U is a finite univeral set and A is a subset of U, then Exercise 4.6.10 implies 
the complement rule, which says 


|A| = |U| — |AS| (4.24) 


Sometimes the task of determining the cardinality of a set is more easily done by 
exploiting Eq. (4.24). 


18 Induct on m 1, 
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EXERCISE 4.8.10 Let Aand Bbe non-empty sets with |A| = m < n = |B|. How 
many distinct functions from A to B are not one-to-one? 


Suppose A is a set with n elements and (a1, ..., a,) is an element of A”, where 
all coordinates of the n-tuple are distinct. We may think of such an n-tuple as a 
way of ordering the elements of A. 


EXERCISE 4.8.11 If A is a set with n elements, in how many distinct ways can 
its elements be ordered? 


Applications 


How many ways are there to put together a meal from all the cafeteria offerings? 
To arrange your CD collection on your dresser? To name a baby boy from a list of 
family preferences? Results from this section can help us answer some questions 
of the sort “How many ways are there to...?” 


Example 4.8.12 Your university cafeteria has the following menu for today’s 
lunch. 


Meats : Meatloaf, Chicken, Fish sticks 
Starchy vegetables : Potatoes, Rice, Corn, Pasta 
Green vegetables : Beans, Broccoli, Salad, Spinach 
Breads : Rolls, Corn bread 
Desserts : Chocolate cake, Pudding 


If you choose one item from each category of the menu, how many different meals 
could you put together? 


Solution If we let A; be the set of meat offerings, Az the set of starchy vegeta- 
bles, and so on, then each potential complete meal is an element of [pea Ax. By 
Exercise 4.8.7, there are 3 x 4x 4x 2 x 2 = 192 possible meals. Ml 


In Example 4.8.12, counting the number of possible meals can be visualized in 
the following way. We have five empty slots to fill on our plate, the first to be filled 
with a choice of meat, the second with a choice of starchy vegetable, and so on. 
Furthermore, the number of choices available to fill a particular slot is unaffected 
by the way any of the previous slots have been filled or the way the remaining 
slots will be filled. Multiplying the number of ways to fill each slot illustrates the 
multiplication rule. If an n-step process is such that the kth step can be done in ax 
ways, and the number of ways each step can be done is unaffected by the choice 
made for any other step, then the total number of ways to perform all n steps is 

kel ak. 
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EXERCISE 4.8.13 How many meals can be put together from the menu in 
Example 4.8.12 if dessert is optional? 


Here is an illustration of the practical use of Theorem 4.8.8. 


Example 4.8.14 Suppose you toss a coin 10 times and observe the sequence of 
outcomes of heads (H) or tails (T). Each possible outcome of the 10 tossings can 
be written as a 10-tuple of the form (H,H,H,T,H,H,T,T,H,T), each of which we can 
visualize as a function from Nio to {H, T}. By Theorem 4.8.8, there are 210 — 1024 
such functions. 


EXERCISE 4.8.15 A password to your computer account must be precisely 
seven alphanumeric characters, that is, a sequence of seven characters taken from 
the 26 letters of the alphabet and the digits 0-9. How many distinct passwords 
may be formed? 


Exercise 4.8.9 will help you answer the next question. 


EXERCISE 4.8.16 The Fitzpatricks are expecting a baby boy any day now. 
Family members have strong opinions about what the child will be named. Sug- 
gested names are William, Warren, Benjamin, Fitzhugh, Chancellor, Millhouse, 
and Nebuchadnezzar. The parents want their son to have three distinct given 
names, such as William Fitzhugh Millhouse Fitzpatrick. How many potential 
names are there for the Fitzpatrick son? 


EXERCISE 4.8.17 How many ways are there to arrange your collection of eight 
CDs in a row on top of your dresser? 


EXERCISE 4.8.18 In California, a standard license plate consists of a digit 1-9, 
followed by three letters, followed by three more digits 0-9, such as 3AAG045. If 
all constructions of this form are considered usable, how many standard license 
plates can California issue? 


EXERCISE 4.8.19 Twenty horses run a race in which prizes are given for win, 
y Pp 8g 

place, and show (first, second, and third places). How many outcomes are there 

for the race? 


Example 4.8.20 Let a word be defined as any arrangement of letters from the 
alphabet, so that, for example EJEIO is a word. If the vowels are {A, E, I, O, U}, 
how many four-letter words contain at least one vowel? 


Solution Let U be the set of all four-letter words, and let A be the set of four- 
letter words that contain at least one vowel. Then |U| = 264 and |AC| = 21%. By 
the complement rule, |A| = 264-214. 
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EXERCISE 4.8.21. For security purposes, a password like that in Exercise 4.8.15 
must contain at least one numeric character. How many such passwords are 
there? 


The remaining exercises will require you to apply all the principles of this 
section. 


EXERCISE 4.8.22 A single digit on an old-fashioned calcu- ae 
lator is created from a configuration of seven light-emitting 3] 
diodes, as in the sketch here. How many distinct symbols can = 
be displayed with these seven lights? 


EXERCISE 4.8.23 Ina first grade class with 20 students, every student sends a 
Valentine’s Day card to every other student. How many Valentine’s Day cards are 
sent among the students? 


EXERCISE 4.8.24 You live in Atlanta and must fly for work to Washington DC, 
Chicago, and Dallas, then back to Atlanta. If you may visit these cities in any order, 
how many different ways can you plan your trip? 


EXERCISE 4.8.25 Six houses in a row are each to be painted with one of the 
colors red, blue, green, and yellow. In how many ways may the houses be painted 
so that no two adjacent houses are the same color? 


8. e ballot for your club’s officers lists 2 candidates for presi- 
EXERCISE 4.8.26 The ballot f y lub’s offi lists 2 candidates for presi 
dent, 4 candidates for vice-president, and 6 candidates for secretary. In how many 
different ways can you mark your ballot if: 

(a) You vote for exactly one person for each office? 

(b) You may choose to leave any item blank? 

EXERCISE 4.8.27 The postal delivery person has five different packages, all to 
be delivered to the same block, where there are 10 houses. How many different 
ways can he or she distribute the packages among the houses, provided: 

(a) There are no restrictions on how the packages may be distributed? 


(b) No house may receive more than one package? 


EXERCISE 4.8.28 Six friends go out for dinner, and each orders something 
different. The server brings 4 of the 6 dishes to the table, but he does not remember 
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who ordered what. In how many different ways could he set these 4 dishes down 
among the 6 people at the table? 


Combinations and Partitions 


In this section, we build on this result to determine the number of ways we may 
select m out of n objects, where we do not distinguish between different arrange- 
ments. Then we generalize this to determine how many ways we may partition a 
given set into distinguishable subsets of given sizes. 


Combinations 


Given a set of n objects and an integer 0 < m <n, we can calculate the number of 
ways we may choose m of the n objects, with no order taken into consideration. 
This is precisely the same as asking how many distinct subsets containing m objects 
may be formed from a set with n objects. We call this the number of combinations 
of n objects taken m at a time, and we denote it C(n, m). An alternate notation is 
("), which is read “n choose m.” 

A natural way to derive the formula for (”) is with an equivalence relation. 
Let S be the set of all the P(n, m) permutations of the n objects taken m at a 
time. Define two permutations in S to be equivalent provided they contain pre- 
cisely the same elements of the set, regardless of the order of these elements. 
Clearly, this is an equivalence relation, so that S is partitioned into equivalence 
classes, where any given class contains all the possible arrangements of a particular 
m-element subset of the n objects. The number of equivalence classes is therefore 
C(n, m), the expression whose formula we are trying to determine. Notice that 
each equivalence class contains precisely m! elements. Since |S| = P(n,m), we 
have that m! x C(n, m) = P(n, m), so that 


C(n, m) = (") yD) (4.25) 


m! m!\(n — m)! 
We have therefore proved the following. 


Theorem 4.9.1 Suppose A is a finite set of cardinality n, and let 0 <m <n. 
Then the number of subsets of A of cardinality m is given by Eq. (4.25). 


Notice that C(n, m) = C(n,n — m),so the number of ways to choose m out of n 
objects is the same as the number of ways to choose n — m of the objects. Notice 
also that C(n, 0) = 1 is meaningful in that there is precisely one zero-element 
subset of a given set. 
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Combinations have a number of interesting mathematical features, and they 
pop up in a variety of mathematical settings. One useful result is the following. 


EXERCISE 4.9.2 Suppose n > 1 and1<k <n.Then 


Ge i) r @ 7 (" ‘y" (4.26) 


It is arguable that people who work in combinatorics enjoy their proofs as 
much as any group of mathematicians. The reason is that there are often several 
seemingly unrelated ways to argue the same result, and to discover a new way to 
prove a combinatorial theorem is a particularly satisfying accomplishment. One 
such combinatorial argument for Exercise 4.9.2 goes like this. 


Proof of Exercise 4.9.2. Let n be a nonnegative integer, and suppose A isa set of 
n + 1 objects. Then the number of k-element subsets of A is given by the right-hand 
side of Eq. (4.26). Now let a be any element of A. The k-element subsets of A are 
of two types—those that contain a and those that do not. To create a k-element 
subset that contains a, we need only choose k — 1 of the n objects other than a. 


There are (,”,) ways to do this. To create a k-element subset that does not contain 


a, we must choose k of the n objects other than a. There are () ways to do this. 


Thus another way to count the number of k-element subsets of A is given by the 
left-hand side of Eq. (4.26). 


4.9.2 Partitioning a Set 


Choosing an m-element subset of an n-element set is the same as partitioning the 
set into two subsets, one of size m and one of size n — m. We can generalize this 
process in the following way. 

Suppose A has n elements and we want to partition A into p subsets whose 
cardinalities are nj, n2,...,p,respectively, where eS ny = nandnone of then; 
is zero. How many ways can we do it? Imagine choosing n1 of the n objects to 
comprise the first subset, then choosing nz of the remaining n — n1 objects to 
comprise the second subset, then n3 of the remaining n — ny; — n2, and so on. By 
the multiplication rule, then we have 


(ei) | ener a) 


n!} (n —n})! (n —n1 —n2)! np! 


mi(n—ny)! n2!(n—1n, —n2)! n3!(n—1n, —n2—7n3)! np!0! 


op oe (4.27) 


ny!nq!n3! - np! 


19 Induction is not necessary, just some straightforward algebraic manipulation. 
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Theorem 4.9.3 Suppose |A| =n, and let n1,...,np be positive integers such 
that )°?_, 2p» =n. Then the number of ways to partition A into distinguishable 
subsets of sizes n1,...,p is given by 


! 
( 7 ) fo ea (4.28) 
N1,...,Np ny!ng!...np! 
Theorem 4.9.3 says that the subsets of A are distinguishable. If all of the subset 
sizesnj,..., pare distinct, then all of the p subsets of A are clearly distinguishable. 


However, if nj =; for some distinct i and j, Eq. (4.28) distinguishes between 
two partitions that differ only in that the elements of the ith and jth subsets are 
interchanged. For example, if A = {1, 2,3, 4, 5}, then the partition 


A, = {2} Az=({1,4} A3 = {3,5} (4.29) 
is considered to be different from 
A ={2} Ar = {3,5} A3 = {1,4} (4.30) 


As an example, consider that we are going to split a group of 12 children into four 
groups of three. Equation (4.28) assumes that each group has a unique feature 
associated with it, so that swapping two entire groups would be considered a 
different outcome. 


Applications 


Example 4.9.4 The State Lottery Commission draws six balls at random from a 
bin of 51 numbered balls, where order does not matter in determining if someone 
wins. There are (.) ways to draw these balls. Thus if you buy one lottery ticket, 


your chances of winning are one in about 18 million. 


Example 4.9.5 Your club of 50 members is going to choose a president, vice- 
president, secretary, a banquet committee of four, and a service project committee 
of five. No one may serve in more than one capacity. This process can be thought 
of as partitioning a 50-element set into subsets of size 1, 1, 1, 4, 5, and 38. The 
number of ways this can be done is 


oa (ugh eV area | oss = ae = a whole bunch (4.31) 
T/\LT/JAIS\ 4/5 111111415138! 


Let’s put together all the counting techniques we have developed to answer 
some more complex questions. 
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Example 4.9.6 Suppose you toss a coin 10 times and observe the sequence of 
outcomes. From Example 4.8.14, we know that there are 2!° possible outcomes. 
We want to count the number of these outcomes that have exactly 3 heads. Imagine 
we have the numbers in Njo = {1,..., 10} written on slips of paper. An outcome 
of the 10 coin tosses with exactly 3 heads can be uniquely specified by taking a 
3-element subset of Ni9 to represent which of the 10 slots are heads. There are 
C(10, 3) = (10-9-8)/(3-2-1) =120 such choices. 


Example 4.9.7 A box of light bulbs contains 100 bulbs, 3 of which are defective. 
If we choose 10 bulbs from the box, we want to calculate how many ways there 
are to get precisely one of the defective bulbs. Such a choice of 10 bulbs consists 
of 9 of the 97 good bulbs chosen without regard to order, and 1 of the defective 
bulbs chosen from the 3 bulbs. Multiplying these, we have C(97, 9) x C(3, 1) ways 
of choosing the bulbs in which precisely one is defective. 


Exercise 4.6.7 says thatif A and Bare disjoint, then|A U B| = |A| + |B]. Applied 
to counting the number of ways of performing a task, we call this theorem the 
sum rule. If counting the number of ways of performing a task must be broken up 
into disjoint cases, the sum rule says simply that the calculations for the different 
cases are added to produce the total number of ways of performing the task. 


Example 4.9.8 A license plate consists of up to seven characters, taken from 
the 26 letters and the digits 0-9. If there is room for spaces, they are not counted 
in the arrangement; for example, there is no distinction between | TANGENT 
and| TAN GENT |. We count the total number of plates that can be issued. There 
are different cases based on the number of characters used. The number of one- 
character license plates is 36; the number of two-character license plates is 367, 
and so on. Thus the total number of license plates is 


36 +367 +36 + --- +367 = 36(1 +--- + 36) = 36(3251) = 3867-1) 


Example 4.9.9 Your club of 30 women and 20 men is going to choose a com- 
mittee of five. We calculate the number of ways to choose the committee if it must 
include at least one man. The number of ways to choose the committee with no 
restrictions is C(50,5), and the number of ways to choose the committee with 
no men is C(30, 5). By the complement rule, the number of ways to choose the 
committee where there is at least one man is C(50,5) — C(30,5). 


EXERCISE 4.9.10 Pizza Peddler offers 12 different toppings and is having a 
special on their two-topping pizzas. How many distinct ways are there to order 
one of their special pizzas? 


EXERCISE 4.9.11 You overheard that a TRUE/FALSE test of 10 questions 
had 7 TRUE and 3 FALSE answers. If you take the test with this knowledge, how 
many ways are there to fill out the answer sheet? 
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EXERCISE 4.9.12 How many ways are there to arrange all the letters in the 
word MISSISSIPPI?” 


EXERCISE 4.9.13 Your office is 10 blocks east and 7 blocks north of your apart- 
ment, in a section of the city where the streets run uninterruptedly north-south and 
east-west. Every day you go to work by walking along these streets the 17 blocks, 
always going either east or north. How many distinct paths of this sort are there 
from your apartment to your office? 


EXERCISE 4.9.14 When you walk home from the office (as in Exercise 4.9.13), 
you always like to stop at the ice cream parlor located on the corner two blocks 
south and five blocks west of the office. How many distinct paths back home are 
there if you always stop for ice cream? 


EXERCISE 4.9.15 Your club consists of 50 members, 22 men and 28 women. 
A committee of 5 is to be chosen, which must contain 2 men and 3 women. How 
many distinct committees can be chosen? 


EXERCISE 4.9.16 A jury of 12 and 2 alternates is chosen from 30 people 
summoned for jury duty. How many ways can this be done? 


EXERCISE 4.9.17 Your club of 30 women and 20 men is going to choose a 
committee of five, which must not consist of five people of the same gender. How 
many ways can it be done? 


EXERCISE 4.9.18 The State Lottery Commission (Example 4.9.4) is consider- 
ing including the number 52 in its drawing, so that the game is played by drawing 
six numbers from Ns). By what percentage is the number of possible outcomes 
increased? 


EXERCISE 4.9.19 In a room with eight people, everyone shakes hands with 
everyone else exactly once. How many handshakes take place? 


EXERCISE 4.9.20 Your freshman seminar has 12 people in it, and your 
professor decides to have several dinner parties for 4 of you at a time. If she 
wants to have a party for every possible group of 4 students from the class, how 
many parties will she have? 


EXERCISE 4.9.21 You have a penn , nickel, dime, quarter, and half dollar, and 
penny. q 

you are going to distribute them among your 12 nieces and nephews, making sure 

that no child gets more than one coin. In how many ways can this be done? 


20 Generalize Example 4.9.6. 
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EXERCISE 4.9.22 You have 5 quarters, and you are going to distribute them 
among your 12 nieces and nephews, making sure that no child gets more than one 
coin. In how many ways can this be done? 


EXERCISE 4.9.23 Ina box of 50 light bulbs, 5 are known to be defective. You 
take asample of 3 bulbs from the box. How many different samples contain exactly 
one defective bulb? 


EXERCISE 4.9.24 Andrew, Bethany, Chigger, and Delusia are going to play 
spades, where all 52 cards are dealt out, 13 to each player. How many ways can 
the cards be dealt? 


EXERCISE 4.9.25 You have invited five friends over for dinner, none of whom 
gave a firm commitment to come. How many different ways can attendance at the 
dinner turn out? 


EXERCISE 4.9.26 Your first-year seminar class of 16 students is going to break 
up into four discussion groups of four students each. The groups are distinguish- 
able, because each group will discuss a different question. How many ways can 
this be done? 


EXERCISE 4.9.27 In the previous exercise, suppose the discussion groups are 
now indistinguishable because each group will discuss the same question. How 
many ways can this be done? 


EXERCISE 4.9.28 Suppose you draw 5 cards from a standard deck of 52 cards. 
Pppose y 

How many such draws have: 

(a) At least one ace? 

(b) No face cards? 

(c) At least one face card? 

(d) All cards of the same suit? 

EXERCISE 4.9.29 A four-letter “word” is to be formed from the letters in the 

set {A, B, C, D, E, F, G}. Repetition of letters is allowed. How many such words 

contain: 

(a) The letter E? 

(b) A vowel? 


EXERCISE 4.9.30 Rework the previous exercise under the assumption that 
repetition of letters is not allowed. 
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EXERCISE 4.9.31 The US. Senate consists of 100 senators, with 2 from each 
state. In how many ways may a committee of 5 senators be formed so that no 
2 senators are from the same state? 


EXERCISE 4.9.32 You have 12 nieces and nephews, 3 of whom are kids of your 
pyromaniac brother Torch. If you distribute 5 quarters among the 12 kids so that 
no child receives more than one quarter, how many such distributions have at 
least one of Torch’s kids receiving a quarter? 


EXERCISE 4.9.33 You have 4 novels, 6 comic books, and 9 textbooks on your 
bookshelf. In how many different ways can you arrange them if you want the 
books of each type to be grouped together? 


4.10 The Binomial Theorem 


The appearance of combinations in a variety of settings reveals some nice ties 
between seemingly unrelated mathematical ideas. First, there is a nice way to 
visualize Exercise 4.9.2 in Pascal’s triangle (Fig. 4.8). To construct Pascal’s triangle, 
we first extend the definition of C(n, k) to allow for k < 0 and k > n. In both of 
these cases we define C(n, k) = 0. 


EXERCISE 4.10.1 Verify that Eq. (4.26) holds fork <0 andk >n+1. 


To construct Pascal’s triangle, we begin with a row that corresponds to com- 
binations of the form C(0,k), which we will therefore call row zero. Because 
C(0, 0) = 1 and C(0, k) = 0 for all nonzero values of k, we construct row 0 to have 
a single 1 entry, with Os elsewhere. 

The next row corresponds to n = 1, and so on, and each entry in a row is 
determined by adding the two entries diagonally above it in the previous row. 


0 0 0 0 
NE 


n=0 
, Poe ee GN wees 
n= 
TT Sd SPs 
n=2 0 2 0 
: Pe Sol 
n= 
j LISI TO 
n= 
LN SBS LN LN OOS 
n=5 0 1 5 10 10 5 1 0 
Lo Nel Neel Nh Ne Ne NNN SS 
n=6 0 1 6 15 20 15 6 1 0 


Figure 4.8 Pascal’s triangle. 
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By Eq. (4.26), it follows that the entries in row n are C(n, k), where the diagonal 
column of 1s headed downward left and downward right correspond to k = 0 
and k = n, respectively. 

Here’s an important place where the entries in Pascal’s triangle appear. Sup- 
pose you want to expand the expression (a + b)” for some nonnegative integer n. 
Rather than multiply out (a + b)(a+ b)--- (a+b) using the extended distributive 
property, there is a much easier way to see what the terms are, and it involves 
C(n, k) in the coefficients. 


Theorem 4.10.2 (Binomial Theorem). Let a and b be nonzero real numbers, 
and let n be a nonnegative integer. Then 


n 


n n 
) n—k pk » ( n ) n—k pk 
= a b= a b 
k=0 (; k=0 Bes 


The only reason we do not allow a or b to be zero in Theorem 4.10.2 is that 
0° is not defined, so Eq. (4.32) would produce some undefined terms. If either a 
or b is zero, the expansion of (a + b)” is not particularly interesting. Thus we omit 
it. Using the entries from Pascal’s triangle, we have 


(4.32) 


(a + b)® = 1a°b° =1 


(a+b)! = 1a'b® +10! =a+b 
(4.33) 
(a+b)? =a’ +2ab4+b? 


(a+b) =a + 3a°b + 3ab” +b’, ete. 


We can argue the binomial theorem in several ways using combinatorial style 
arguments that appeal to some of our previous counting techniques. One way is to 
note that applying the extended distributive property to (a+ b)(a+b)--- (a+b) 
is tantamount to creating a whole bunch of terms of a form like aaabbaab, where 
from each (a+ b) factor we select either a or b. Since each factor allows for two 
possible choices, there are 2” terms generated, and every one is of the form a”~*b* 
forsome0 < k < n.Todetermine the coefficient of a”—*b*, we must determine how 
many times the term a’~*b* appears in all the distribution of the multiplication. 
If k of the factors supply us with b and the remaining factors provide us with a, 
then the number of times a"~*b* appears in the grand sum is the same as the 
number of ways of choosing k of the n terms to provide us with b (or n — k of the 
terms to provide us with a). That is, the term a"~*b* is produced C(n, k) times in 
the distribution. 
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EXERCISE 4.10.3 Determine the following. 


(a) The coefficient of a2b° in (a +b)! 
(b) The coefficient of a*b° in (a — b)’ 
(c) The coefficient of x* in (x + 2)° 
(d) The coefficient of x*y* in (3x — y)° 

We’re also going to provide an algebraic proof of Theorem 4.10.2. The tech- 
nique we will use in making the inductive step contains some manipulation of 
summations that can really come in handy in some of your other coursework, 
especially, for example, differential equations and complex analysis, where you 
must resort to finding what is called a series solution to a problem. Watch how we 


pull off some first and last terms from the summations and then realign the terms 
by changing the counter variable. 


Proof of Theorem 4.10.2. Let a and b be nonzero real numbers. 


(J1) Forn =0, (a+b)° = 1= C(0, 0)a°b", so Eq. (4.32) is true for n = 0. 
(J2) Suppose n > 0 and that Eq. (4.32) is true for n. Then 


(a+b)"*! = (a+b)"(at+b) = b C(n, oa (a+b) 


i=0 


n n 
=) C(n, ia" b+ SY C(n, ia" bt! (4.34) 
i=0 i=0 


n n 
=o Gr, Daa Cas alt SID 
i=0 i=0 


To align the terms in the two summations, let j = i + 1 for the second summa- 
tion (that is,i = j — 1), and note that 0 <i <nisthesameas1 < j<n+1. 
Making this substitution first, then pulling off a few individual terms, we can 
continue Eq. (4.34) further in several steps. 


n n+1 
= ys Cn, ia" t)-ipi + y C(n, j— Da@+)-Jpi 
i=0 j=l 


a cn Ge (4.35 
=> Cin, aet)'b + > C(n, §— Ya *P-Joi OP) 
1 q=l 


+ C(n, O)a"t!—p? + Cn, n)a® bt! 
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Now C(n, 0) = C(n +1, 0) and C(n,n) = C(n+1,n +1). Also, we may let 
k =i = j and combine the two summations and apply Exercise 4.9.2 to have 


n 
= Yoica, b+ Cn, k —1)Ja@td-kpk 
k=1 
+Cin+1, O)a"t1—0p9 CW 1.44 1)a°pr*t 


n 


=o Cin+1,battD-*pk (4.36) 
k=1 
+ C(n +1, 0)a"t?-P + Can + 1,04 1)a°o"*! 
n+1 
= ye Cin +1, hat t)—-*pk 
k=0 


which verifies that Eq. (4.32) holds for n + 1. 


The algebraic proof of Theorem 4.10.2 means that we have provided an 
algebraic proof of the following. 


Corollary 4.10.4 For all nonnegative integers n, 


se (") =2" (4.37) 


k=0 


Proof. Let a = b = 1 in Eq. (4.32). 


Suppose A is a finite set with cardinality n. Corollary 4.10.4 implies that A 
has 2” subsets, for the left-hand side of Eq. (4.37) is the sum of the number of 
subsets of an n-element set with all possible numbers of elements. Another way to 
see the same result is to create a one-to-one correspondence between the family 
of all subsets of A and the collection of the 2” functions from A to {0, 1}. If Aj is 
a given subset of A, we may define 


1, ifaeA 
fla) = } 


(4.38) 
0, ifa ¢ Aj 


Every distinct subset of A generates a distinct function, and by Theorem 4.8.8, 
there are 2” functions from A to {0, 1}. 


EXERCISE 4.10.5 Use your imagination to generalize the binomial theorem, 
so that you can determine the following. 


(a) The coefficient of a2b*c? in (a + b+ c)® 
(b) The coefficient of ab* in (a + b + c)® 
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(c) The coefficient of ab>c* in (a — b+ c)® 

(d) The coefficient of a*b* in (a — b — c)® 

(e) The coefficient of a*b? in (a +b+ 5)’ 

(f) The coefficient of ab* in (a + 2b + 3)8 

(g) The coefficient of a7b*c? in (2a + b — 3c + 5)!° 
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5.1 


The Real Numbers 


Let’s look at some defining characteristics of the area of mathematics we call 
analysis. Given a set S, its elements might be endowed with a measure of size. 
The size, or norm, of an element x is typically denoted |x|, like absolute value 
on the real numbers, or perhaps ||x||. The norm of an element will always be a 
nonnegative real number, so that a norm is really just a function from S into the 
real numbers that has three defining characteristics. You have already run into 
these in Section 2.3. They are properties N1—N2 on page 54 and property N3 on 
page 55. 

Measuring the sizes of elements is only one type of structure that can be placed 
on a set that puts it squarely in the field of analysis, but some notion of measure 
with nonnegative real numbers is characteristic of structures in analysis. For exam- 
ple, some structures do not have a norm but have a way of measuring some idea of 
distance between elements. Such a measure of distance is called a metric. Whether 
S is endowed with a norm or a metric, the measure it represents is inextricably 
tied to the real numbers. So the set of real numbers is at the heart of analysis, 
and one could argue that no analysis into any structure except the real numbers 
should be undertaken until one understands the axioms and fundamental results 
of the theory of real numbers. 

Be that as it may, a norm or metric on S, if one exists, lends itself to much 
fruitful study: Sequences and their convergence, continuity, and calculus are but 
a few. In this chapter, we address more advanced properties of the real numbers 
that arise out of the assumptions from Chapter 0, in particular A20. All these 
properties are important not only because they apply to the real numbers, but 
also because they are typical of properties of many other structures in analysis. In 
this chapter and the one to follow, all sets are assumed to be subsets of the real 
numbers unless specified otherwise. 


The Least Upper Bound Axiom 


The least upper bound (LUB) axiom is a standard axiom of the real numbers, 
endowing it with some of its familiar features. For example, the way we visualize 
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the real numbers as a smooth number line with no holes is due to the LUB axiom. 
Two other characteristics of the real numbers are logically equivalent to the LUB 
axiom. One is called the Nested Interval Property (NIP) and has to do with what 
you get when you intersect a whole bunch of real number intervals that have 
certain properties. The other is called completeness and has to do with the way 
certain sequences behave when the terms get close to each other. It is possible 
to assume any one of the LUB axiom, the NIP, or completeness and derive the 
other two as theorems. Assuming the LUB property as an axiom is probably most 
common, so we choose that route. In this chapter, we will prove the NIP from 
the LUB axiom and completeness from the NIP, and look into the converses of 
these theorems to show that all three are logically equivalent. In this section we 
explore the LUB property in some depth to get a feel for what it means, and we 
derive some immediately important implications. First, let’s present a definition. 


Definition 5.1.1 A set A is said to be bounded from above if there exists M, 
such that a < M, for all a A. A is said to be be bounded from below if there 
exists Mz such that a > Mp) for all a € A. A is said to be bounded if there exists 
M > Osuch that |a| < M forallae A. 


The first exercise appears to be a very obvious result, but it must be demon- 
strated in terms of Definition 5.1.1. Proving the => direction is easier. For if A 
is bounded, then the guaranteed M > 0 such that |a| < M for all a € A should 
clearly suggest values of M1 and M2. However, in proving the < direction, you 
must use the existence of M; and M2 to create a single, positive value of M such 
that —-M <a< MforallacA. 


EXERCISE 5.1.2 A set of real numbers is bounded if and only if it is bounded 
from above and below.! 


Least Upper Bounds 


Suppose A is non-empty and bounded from above. Among all possible upper 
bounds for A, we would call L a least upper bound for A if L is an upper bound 
for A with the additional property that L < N for every upper bound N. That is, 
among all upper bounds for A, none is any smaller than L. The LUB axiom says 
that every non-empty set of real numbers that is bounded from above has an LUB 
that is a real number. We state it again here for the sake of reference. 


(A20) Least upper bound property: If A is a non-empty subset of real numbers 
that is bounded from above, then there exists a real number L with the 
following properties: 


(L1) For every a € A, we have that a < L, and 
(L2) If N is any upper bound for A, it must be that N > L. 


1 T prove <, let M = |M,|+|M2| + 1. Use Exercise 2.3.7. 
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EXERCISE 5.1.3 The least upper bound of a set of real numbers is unique. 


In this chapter we will make extensive use of interval notation to represent 
sets of real numbers. 


(a,b) ={xeR:ia<x<)} (5.1) 
[a,b] ={x €R:a<x<b} (5.2) 
(a, +00) = {x ER: x>a} (5.3) 
(—c0, a] = {x ER: x <a} (5.4) 


Intervals of the form (5.1) and (5.3) are called open intervals, and those of the 
form (5.2) and (5.4) are called closed intervals. The motivation for these terms will 
become clear in Section 5.3 where we discuss open and closed sets in general. 


Example 5.1.4 Suppose a < b. Show that b is the LUB of the interval (a, b). 


Solution If x € (a, b), then x < b, so that b has property L1. To show that b has 
property L2, let N < bbe any real number. We show that N is not an upper bound 
of (a, b). If N < a, then by Exercise 2.2.9, 


N<a<(a+b)/2 <b 


so that (a + b)/2 is an element of (a, b) that is greater than N. On the other hand, 
if N > a, then since N < b also, we have 


a<N<(N+)D)/2 <b 


Thus (N + b)/2 is an element of (a, b) that is strictly greater than N. In either case, 
N is not an upper bound of (a, b), and b therefore has property L2. 


Least upper bounds can be defined in terms of an alternate pair of properties, 
which we will call M1—M2. In some situations, showing a particular number is the 
LUB of a set is more easily done by showing that it has these alternate properties. 
Your task (Exercise 5.1.6) will be to show that properties L1-L2 are equivalent to 
M1-M2. But first, let’s state the equivalent form as a theorem to suggest a way of 
attacking its proof. 


Theorem 5.1.5 Given a set A, L is the LUB for A if and only if the following 
two conditions hold. 


(M1) For every € > 0 (no matter how large), (L, L + €)N A is empty. 
(M2) For every « > 0 (no matter how small), there exists ae AN (L —e«, L]. 


The Greek letter € (epsilon) has been used so much in analysis to represent an 
arbitrary positive real number that it has come to have a personality all its own. 
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Although ¢ is not generally thought to be of any particular size, it is usually present 
in theorems and proofs because smaller values of € represent the primary obstacle 
to overcome in concocting the proof. Theorem 5.1.5 is a natural way to visualize 
the LUB property in terms of how intervals to the left and right of L intersect the 
set A. Every interval of the form (L, L + €), no matter how large « is, must not 
contain any elements of A. Also, every interval of the form (L — ¢, L], no matter 
how small € is, must contain some element of A. 

Here is a suggestion about how to tackle the proof of Theorem 5.1.5. Given a 
set A and areal number L, the theorem says (L1 A L2) < (M1 A M2). For the > 
direction, Exercises 1.2.18(f) and (i) imply that 


(L1 A L2) > (M1 A M2) & [(L1 A L2) > MIA [(L1 A L2) > M2] 


& [(L2 A =M1) > -L1] A [(L1 A =M2) > =L2] 
(5.5) 


The statement ~M1 — -—L1 is at least as strong as (L2 A =M1) > -L1. So to 
prove the — direction of Theorem 5.1.5, you will want to show L1 > M1 by 
contrapositive and then show (L1 A L2) > M2 by showing (L1 A —=M2) > -=L2. 
Then you can prove < by showing ~L1 — —M1 and -L2 — —M2. 


EXERCISE 5.1.6 Prove Theorem 5.1.5. 


Example 5.1.7 Show that the set {1} has LUB 1 by appealing to properties 
M1-M2. 


Solution Pick « > 0. Because (1, 1 + €) M {1} is empty, property M1 holds. Also, 
because 1 € (1 — «, 1] N {1}, property M2 holds. Thus 1 is the LUB of {1}. 


EXERCISE 5.1.8 Show that b is the LUB of the interval (a, b) by appealing to 
properties M1—M2. (An identical argument will work for [a, b].) 


Greatest Lower Bounds 


Now let’s turn the LUB property upside down and discuss bounds from below. 
We do not have to make any new assumptions concerning the existence of the 
greatest lower bound of a set, for we can derive results from the LUB property 
by flipping a set upside down, so to speak. 


Definition 5.1.9 A real number G is said to be a greatest lower bound (GLB) 
for a set A if G has the following properties: 


(G1) For every a € A, we have that a > G, and 
(G2) If N is any lower bound for A, it must be that NV < G. 
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EXERCISE 5.1.10 The greatest lower bound of a set is unique. 
The next exercise will serve as a helpful lemma for the exercise to follow. 


EXERCISE 5.1.11 Suppose A is a set that is bounded from below by M, and let 
B= {—a:a€ A}. Then Bis bounded from above by —M. 


EXERCISE 5.1.12 IfAisa non-empty set of real numbers that is bounded from 
below, then A has a GLB.? 


EXERCISE 5.1.13 State and prove a theorem analogous to Theorem 5.1.5 for 
the GLB of a set. 


EXERCISE 5.1.14 Let A be a subset of the real numbers, and define the set 
B= {-—a:aé€ A}. Then A has LUB L if and only if B has GLB mY bre 


The Archimedean Property 


The LUB property implies that the positive integers are unbounded from above, 
which is the claim of the next exercise. To prove it, suppose the result is false, claim 
the existence of a LUB, and then exploit properties M1—M2 for a strategically 
chosen value of € to produce a contradiction. 


EXERCISE 5.2.1 For all positive real numbers x, there exists a positive integer 
n such that n > x. 


Exercise 5.2.1 has a logically equivalent form called the Archimedean pro- 
perty. We have already pointed out briefly that the ancient Greeks were very 
sophisticated mathematically. One idea they used involved what we would call an 
infinitesimal number. Different from zero, an infinitesimal was considered to be 
smaller than every positive number. One salient property of an infinitesimal is that 
you can add it to itself any finite number of times and still have an infinitesimal. 
The next result says that there are no infinitesimals in the real numbers. It is named 
after Archimedes, who argued against the use of infinitesimals. 


EXERCISE 5.2.2 [Archimedean Property] For all positive real numbers ¢ and 
r, there exists a positive integer n such that ne > r.4 


Thus no matter how small a positive number € might be, and no matter how 
large a positive number r is, you can add € to itself some n number of times 


2 If B from Exercise 5.1.11 has LUB L, you can show that —L is the GLB for A. 
3 You might be able to appeal to your proof of Exercise 5.1.12. 
4 Apply Exercise 5.2.1 to x = r/e. 
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to produce a sum that exceeds r, so that ¢€ is therefore not an infinitesimal. 
The Archimedean property often proves to be useful in a slightly different form, 
using the specific case r = 1. We state it as a corollary to Exercise 5.2.2. 


Corollary 5.2.3 For alle > 0, there exists a positive integer n such that 1/n < e. 
This alternate form of the Archimedean property says that numbers of the 


form 1/n can do a pretty mean limbo dance in the real numbers. No matter how 
low you set the e-bar, the reciprocal of some positive integer can wiggle under it. 


EXERCISE 5.2.4 Between any two real numbers a < b, there exists a nonzero 
rational number.>:% 7:8 


EXERCISE 5.2.5 Between any two real numbers a < b, there exists an irrational 
number.” 


Maximum and Minimum of Finite Sets 
The intervals (a, b) and [a, b] illustrate that the LUB and GLB of a set might or 
might not actually be in the set. If a set is finite, then the LUB and GLB will be 


elements of the set, and we will call them the maximum and minimum, respectively. 
The Archimedean property plays an important role in the proofs of these facts. 


EXERCISE 5.2.6 If S= {ay} _4 is a finite set of real numbers, then S is 
bounded.!? 


Which of the following statements is stronger? 

For every disease, there exists a medicine that cures that disease. (5.6) 
or 

There exists a medicine that cures every disease. (5.7) 
Statement (5.7) says that there exists a panacea, so that clearly (5.6) is true. But 
just because (5.6) is true, it might be because there are many different medications, 


each of which applies only to one disease. So a panacea does not necessarily exist, 
and (5.7) is not necessarily true. Thus (5.7) is stronger than (5.6). 


First start by assuming a > 0. Worry about the other cases later. 
Apply Corollary 5.2.3 to € = b—a. 

If 1/n < b—a, youcan apply Exercise 5.2.2 to a and 1/n. 

Use the WOP for the right m such that m/n > a. 

Apply the result of Exercise 5.2.4 to a/V/2 < b/V/2. 

Let M=1+ 322, lagl. 
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Now suppose we have a finite set {a,}?’_, in which each ax is positive. Then we 
may apply the Archimedean property to each a; separately to claim the existence 
of positive integers {n1,n2,...,p}, where 1/nxz < ax for every k. This is like say- 
ing every disease has a cure, because it says every a; has its own nx. But in the next 
exercise, we claim that there is a single positive integer n that satisfies 1/n < ax 
for all k. To find this panacea n, you will need to build it in terms of all the nx. 


EXERCISE 5.2.7. Let S= {az}P_4 be a finite set of positive real numbers. Then 
there exists a positive integer n such that 1/n < a, for all 1 < k < p.'! 


The Archimedean property says that numbers of the form 1/n are clustered 
around zero arbitrarily closely. If you zoom in to any interval (—e, €), no matter 
how small e might be, there will be some 1/n in this interval. You cannot zoom 
in close enough to find an interval around zero that is devoid of numbers of the 
form 1/n. In general, if it is possible to find some € > 0 such that the interval 
(L — ¢, L + €) contains no elements of a given set, we say the set can be bounded 
away from L. Exercise 5.2.7 says that a finite set of positive numbers can be 
bounded away from zero. We state this fact as a corollary. 


Corollary 5.2.8 A finite set of positive real numbers can be bounded away from 
zero. That is, if S = {ax} = is a finite set of positive real numbers, then there exists 
M > Osuch that q > M forall1 <k < p. 


If S is a finite set, then by Exercise 5.2.6, S is bounded. Thus by Exercise 5.1.2, 
it is bounded from above and below. By the LUB and GLB properties, $ has a 
LUB and GLB. In fact: 
Theorem 5.2.9 A finite set of real numbers contains its LUB and GLB. 


EXERCISE 5.2.10 Prove that the set in Theorem 5.2.9 contains its LUB. 
(The argument that it contains its GLB would be similar.)!* 


Definition 5.2.11 Let S be a finite set of real numbers. Then the LUB and 
GLB of S are called the maximum and minimum values of S, respectively, and 
are denoted max(S) and min(S). 


The maximum and minimum of a finite set will prove to be important in later 
sections. Here are two simple illustrations. First, suppose anyone at least 16 years 
of age can drive, anyone at least 18 can vote, and anyone at least 65 can collect 
Social Security. How can you guarantee that a chosen person can drive, vote, and 
collect Social Security? Certainly you would choose a person whose age is at least 


Letn = ys Nk. 
If L is the LUB of S and L ¢ S, then you ought to be able to use the set T = {L — ag}? _y and 
Corollary 5.2.8 to contradict the assumption that L is the LUB of S. 
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max{16, 18, 65}. For if we write M = max{16, 18, 65}, then a person of age M can 
drive, vote, and collect Social Security because M satisfies all three inequalities 
M > 16, M > 18,and M > 65. 

As another example, suppose every real number within €; distance of zero is 
in set A, and every real number within €2 of zero is in set B. That is, if |x| < «1, 
then x € A, and if |x| < €2, then x € B. If we let € = min{e1, €2}, we can be sure 
that every real number within ¢ of zero will be in AN B. For if |x| < €, then both 
|x| < €, and |x| < «2 are true,so that x € A andxe B. 

The terms max(S) and min(S) are often used when S is not finite in the event 
that S contains its LUB and GLB. 


Open and Closed Sets 


Research in the area of topology flourished more through the middle half of the 
twentieth century than in any other period. In these next few sections, we introduce 
some of the basic terminology that is commonly used in topology, but we apply 
them only to the real numbers. 


Definition 5.3.1 Given a real number a and « > 0, we define the ¢-neighbor- 
hood of aas 


N-(a) = (a-—€,ate={xEeR:ia-—e<x<ate={x ER: |x-a| <6} 


(5.8) 
and we say that the neighborhood has radius e. 
Notice if 0 < €; < 2, then Ne, (a) C Ne, (a). For if x € Ne, (a), then 
a—-@<a-E <x<at+ea<at+ea (5.9) 


so that x € N,, (a). 


Definition 5.3.2 A set A is called open provided for every a € A, there exists 
€ > Osuch that N-(a) C A. 


A set is open if every element is in some interval contained entirely within the 
set; that is, every point is, in a sense, insulated from the outside of the set by a 
neighborhood that is contained completely within the set. 


EXERCISE 5.3.3. Show that the intervals (a, +00) and (—oo, b) are open. 


What does it mean for a set A not to be open? 
=(A is open) & 7=(Va € A)(e > 0)(N-(A) © A) 
<> (da € A)(We > 0)(N E A) (5.10) 
<> (da € A)(We > 0)(Ax € Ne(a))(x € A) 
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Thus A is not open provided there is at least one point in A, for which every 
€-neighborhood contains some point not in A. Some point of A has points of AC 
in every €-neighborhood, no matter how small € might be. 


EXERCISE 5.3.4 Show that the singleton set {a} is not open. 
EXERCISE 5.3.5 Show that the set (a, b] is not open. 


Two other sets besides (a, +00) and (—oo, b) immediately present themselves 
as open sets. The empty set is open simply because otherwise there would have to 
exist some x in the empty set with some property, and that is a contradiction. The 
real numbers are an open set because for any real number a, we may let e = 1 and 
have N-(a) CR. 

Let’s address unions and intersections of open sets. If A and B are both open, 
do you suspect A U Bis open? What about A M B? In both cases the answer is yes, 
but when an arbitrary family of open sets is combined by union or intersection, 
the answer can change. 


EXERCISE 5.3.6 Let #= {A} be a family of open sets, and let {B,}{_, be a finite 
family of open sets. Then 


(a) U- A is open. 
(b) (a1 Be is open. 
By Exercise 5.3.6, it follows that an interval of the form (a, b) is open, for 


(a, b) = (—oo, b) N (a, +00). The intersection of an infinite family of open sets 
certainly could be open, but it might not be. 


EXERCISE 5.3.7 Show that the intersection of infinitely many open sets might 
not be open by showing (7°, (—1/n, 1/n) = (o}.)8 


If we were discussing doors or stores or minds, to say that one is closed would 
mean that it is not open. However, to say a set is closed does not mean it is 
not open. 


Definition 5.3.8 A set A is said to be closed provided A© is open. 


EXERCISE 5.3.9 The singleton set {a} is closed. 


EXERCISE 5.3.10 Ifa< b, then the interval [a, b] is closed. 


Show that zero is an element of every interval (—1/n, 1/n) and that any nonzero real number fails to 
be in some (—1/n, 1/n). 
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EXERCISE 5.3.11 Let F= {A} be a family of closed sets, and let {By}7_, be a 
finite family of closed sets. Then 


(a) (\A is closed.!* 
(b) Use; Bx is closed. 


By Exercises 5.3.9 and 5.3.11, it follows that any finite set is closed because it 
is a finite union of singleton sets. The integers are a good example of an infinite 
set that is closed, because Z© = Unez(n,n + 1). 


EXERCISE 5.3.12 Give an example of an infinite family F of closed sets such 
that UA is not closed. Prove.!> 


EXERCISE 5.3.13 Show that {1/n}°° | is not closed by showing its complement 
is not open. 


Lots of sets are neither open nor closed. For example, [a, b) is sometimes called 
a half-open or half-closed interval. Believe it or not, some sets are both open and 
closed. The empty set and the real numbers are both open and closed because 
they are both open and they are complementary. Theorem 5.3.14 uses the LUB 
axiom to show the important fact that these are the only two sets of real numbers 
that are both open and closed. The proof is a little intricate, so we provide it here. 
It is important for you to work your way through the details because it illustrates 
an important type of proof where we show that certain examples of something are 
the only ones that exist. Keep a pencil and paper handy to sketch some number 
lines and help you visualize the details. 


Theorem 5.3.14 The empty set and the real numbers are the only subsets of 
the real numbers that are both open and closed. 


Proof. We prove by contradiction. Suppose there exists a non-empty proper sub- 
set A of the real numbers that is both open and closed. Then A and A® are both 
open and non-empty. Pick any a, € A and aj € A©. Without any loss of generality, 
we may assume a; < a2. Let E = {ce > 0: N<(a) C A}; that is, E is the set of all 
€ > 0 such that the e-neighborhood of a, is contained entirely within A. Since A 
is open, E is non-empty. Because a2 € A©, E is bounded from above by az — a1. 
Thus we may apply the LUB property to E to conclude the existence of some €0, 
the LUB of E£. Thus N,,(a1) is the largest neighborhood of a; that is contained 
entirely within A. 

In order to arrive at a contradiction, we look at the end points of the interval 
(a1 — €0, 4 + €0). First we ask if it is possible that a1 + €o € AC. If so, then the fact 


14 See Exercise 3.3.8. 
15 You shouldn’t have to look too far. 
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that AC is open means there exists €; > 0 such that Ne, (a1 + €9) C A°. Because 
a, € A, we know that €; < eg. But then a; + €9 — €,/2 € A©. This contradicts 
the fact that N.,(a1) C A. Thus a; + €9 € A. By an identical argument applied to 
a, — €9, we have that a; — eg € A, so that [aj — €9,a, + €9] C A. 

Now let’s look again at the end points of [a1 — €o, a1 + €0]. Since A is open 
and a, — €9,a, +€9 € A, there exist €2,€3 > 0 such that N.,(a1 —€9) C A and 
Ne, (a1 + €0) C A. Letting eg = min{e2, €3}, we have that Ney+<,(a1) C A. This con- 
tradicts the fact that €o is the LUB of E£, for we have shown that ¢9 + €4 € E. Thus 
it is impossible that A and A© are both open and non-empty. 


Theorem 5.3.14 is important both in its own right and because it motivates 
the idea of connectedness of sets in topology. By Theorem 5.3.14, if A is a non- 
empty, proper, open subset of the real numbers, then A© is a non-empty, proper 
subset of the real numbers that is not open. Thus Theorem 5.3.14 implies the 
following. 


Corollary 5.3.15 The real numbers cannot be written as the disjoint union of 
two non-empty open sets. 


Another way to say this is that if A and B are open and disjoint, andif AU B = R, 
then either A = % or B = Y. This property gives us the imagery of the real number 
line as one connected piece, and as the proof reveals, follows from the LUB axiom. 
In the more abstract setting of topology, a set is said to be connected by definition 
if it cannot be written as the disjoint union of two non-empty, open sets. 


Interior, Exterior, Boundary, and Cluster Points 


Suppose A is a subset of the real numbers and x is any real number. We can 
characterize the relationship of x to A in two distinct ways. The first is by placing 
x in precisely one of the interior, exterior, or boundary of A. The other is by 
determining whether x is a cluster point of A, which has to do with whether 
elements of A are packed tightly around x. 


Interior, Exterior, and Boundary 
Given a set A and any real number x, precisely one of the following must be true. 
1. There exists « > 0 such that N.(x) C A, in which case we say that x is in the 


interior of A, written x € Int(A). 


2. There exists « > 0 such that Ne(x) € A, in which case we say that x is in the 
exterior of A, written x € Ext(A). 


3. For alle > 0, there exists aj, az € N¢(x) such that aj € A and ay € A©, in which 
case we say that x is on the boundary of A, written x € Bdy(A). 
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Here are some immediate observations. 

1. Int(A) C A. For if x € Int(A), then there is an e-neighborhood of x contained 
entirely within A, and x is in this neighborhood. Thus x ¢€ A. 

2. Ext(A) © A© by similar reasoning. 


3. Ifx € Bdy(A), then x might or might not be in A. However, if x € A, then every 
neighborhood of x contains elements of A©. Similarly, if x ¢ A, then every 
neighborhood of x contains elements of A. 


4, Int(A) = Ext(A°). 
5. Ext(A) = Int(A°). 
6. Bdy(A) = Bdy(A°). 
EXERCISE 5.4.1. Foreach of the following sets, determine the interior, exterior, 
and boundary. 

(a) {1} 

(b) (0, 1] 

(c) (1/n}ey 

(d) @ 

(e) Z 

(f) R 

(g) Q 


The proof of the next exercise is quick. It is useful because it gives us an 
equivalent way to think about open sets. 


EXERCISE 5.4.2 A set A is open if and only if Int(A) = A. 
EXERCISE 5.4.3. If L is the LUB ofa set A, then L € Bdy(A). 


Cluster Points 


Once again, let A be a subset of the real numbers, and let x be any real number. It 
might be that elements of A are packed densely around x. 


Definition 5.4.4 A real number x is said to be a cluster point of A, provided 
every €-neighborhood of x contains a point in A other than x itself. That is, for all 
€ > O, there exists a € AN N(x), where a F x. 


A cluster point of A might or might not be an element of A. 
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EXERCISE 5.4.5 Show that zero is a cluster point of the following sets. 


(a) (1/n}my 
(b) [0,1] 


The definition of cluster point motivates a new term for convenience. 


Definition 5.4.6 For a real number x and « > 0, the set N¢(x) — {x} is called 
the deleted «-neighborhood of x and is denoted DN,(x). Another way to write 
DN.(x) is fy € R: 0 < |y —x| < €}. 


Thus we may say that x is a cluster point of A if every deleted e-neighborhood 
of x contains an element of A; that is, for every « > 0, AN DN¢<(x) is non- 
empty. Cluster points have other common names, for example, limit points or 
accumulation points. 


EXERCISE 5.4.7 Suppose L is the LUB ofaset A and L ¢ A. Then L is a cluster 
point of A. 


EXERCISE 5.4.8 What does it mean for x not to be a cluster point of A? 


If x € A, but x is not a cluster point of A, we say that x is an isolated point of A. 
Notice in the proof of the next theorem how we must replace an arbitrarily cho- 
sen € > 0 with €2 > 0, which might be smaller. The desired point that we find in 
the €2-neighborhood will therefore also be in the e-neighborhood. 


Theorem 5.4.9 If A is an open set, then every element of A is a cluster point. 
Proof. Suppose A is open and pick any x € A. To show x is a cluster point of A, we 


must first choose € > 0. Since A is open, there exists €; > 0 such that Ne, (x) C A. 
Let e2 = min{e, €,}. Then the point x + €2/2 € AN DN, (x). 


EXERCISE 5.4.10 Show that the converse of Theorem 5.4.9 is not true. 


Sometimes one statement might seem stronger than another but is actually 
equivalent. In the next exercise, the fact that every deleted neighborhood contains 
some point of A is actually strong enough to allow you to show that every deleted 
neighborhood contains infinitely many points of A. 


EXERCISE 5.4.11 If x is a cluster point of A, then every «-neighborhood of x 
contains infinitely many elements of A.!° 


16 ‘Try a proof by contrapositive. 
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The next exercise gives us a way to think about a closed set without having to 
refer to its complement. 


EXERCISE 5.4.12 A set is closed if and only if it contains all its cluster points. 


EXERCISE 5.4.13 Show that {1/n}°_, is not closed. 
/ n=l 


Closure of Sets 


If a set A is not closed, we might want to close it off, so to speak, by finding the 
smallest closed superset of A, if there is one. First we define a term for this smallest 
closed superset of A. Then we address whether it exists, and if so, whether it is 
unique. 


Definition 5.5.1 Suppose A is a set of real numbers, and suppose C is a set with 
the following properties. 


(C1) ACC. 
(C2) C is closed. 
(C3) If Disaclosed set and A C D,thenC C D. 


Then C is called a closure of A, and is denoted A. 


Notice how Definition 5.5.1 lays down the characteristics that the set C must 
have in order for it to qualify as a smallest closed superset of A. Property Cl 
guarantees that any set we would call A is indeed a superset of A. Property C2 
guarantees that it is closed, and property C3 says that no closed superset of A will 
be any smaller. 

Let A be a set, and consider the family of subsets of the real numbers, where 
each set in the family is a closed superset of A. That this family is non-empty is 
immediate, for R itself is a closed superset of A. In the next exercise, you will show 
that the closure of a set exists uniquely. What you will show is that the intersection 
of all closed supersets of A has properties C1—C3 and that any two sets that both 
have properties C1—C3 are actually the same. 


EXERCISE 5.5.2 Suppose A is a set of real numbers. Then A exists uniquely 
and can be constructed as 


7m eee (5.11) 
SDA 
S closed 


17 See Exercises 3.3.14 and 5.3.11 and Theorem 3.3.11. 
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The construction of A in Exercise 5.5.2 is a way of closing off a set A by 
intersecting down from above in a sense, through closed sets containing A, as 
if we were shrink wrapping A. Another way to imagine the closure of a set is in 
terms of its interior and boundary. 

Recall that for a given set, every real number is in precisely one of the interior, 
exterior, or boundary of the set. Also, no element of the set is in the exterior. 
Exercise 5.5.4 says that the closure of a set can also be thought of as the set itself, 
some points of which might be interior points and some boundary points, with 
any missing boundary points tossed in. A hint will reveal one way to approach the 
proof. First, you will need the following. 


EXERCISE 5.5.3 What does it mean to say x ¢ A? 
EXERCISE 5.5.4 If A is aset of real numbers, then A = Int(A) U Bdy(A).!8 


EXERCISE 5.5.5 Use either Exercise 5.5.2 or Exercise 5.5.4 to determine the 
closure of the following sets. 


(a) Y 

(b) R 

(c) (a,b) 

(d) (a,b) U @,0) 

(e) (1/n}ry 

(f) Q 

EXERCISE 5.5.6 Give an example of two disjoint sets with the same closure. 


EXERCISE 5.5.7 If AC B,thenA Cc B.!” 


EXERCISE 5.5.8 A is closed if and only if A = A. 


Corollary 5.5.9 A=A. 


Proof. Since A is closed, then by Exercise 5.5.8, A=. 


EXERCISE 5.5.10 One of the following statements is true, and for the other, 
one direction of subset inclusion is true. Prove the three true subset inclusion 
statements and provide a counterexample to illustrate that the fourth is false. 


18 Show (A)© = Ext(A). 
19 What is the relationship of B to A? 
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(a) AUB=AUB” 
(b) ANB=ANB 


Compactness 


A special class of set deserves our attention now. Sets that are compact have a 
fundamental feature that makes them very important if they serve as the domain 
of certain types of functions. Before we can define compactness, we need another 
term first, motivated by a little story. 

My greatgrandmother was known to make quilts. The standard way of mak- 
ing them was to collect hundreds of small pieces of cloth and then stitch them 
together into an interesting pattern to form a single entity—the quilt. Now sup- 
pose you are lying on the bed, and your grandmother takes a finished quilt and 
tosses it over you as if she were making up the bed. What would it mean to say 
that she has covered you? 

You can answer this question in two ways. One is to say that she has covered 
you provided every point on your body lies under some point of the quilt. That 
is, for every point of your body, there exists a point of the quilt that lies on top 
of the chosen point of your body. Another way to answer the question is similar, 
but expressed in terms of the individual pieces of cloth that were used in making 
the quilt. 


EXERCISE 5.6.1 Use the universal and existential quantifiers to construct an 
alternate statement equivalent to your body being covered by the quilt. 


Now suppose that the pieces of cloth used in the quilt are subsets of the real 
numbers, and think of the collection of these pieces as a family of sets. Stitching 
the pieces together to form the quilt is analogous to forming the union across the 
family of sets, forming a single set. Now imagine that your body is also a subset of 
the real numbers. Translating the quilt story into this set language, together with 
our first explanation of what it means to be covered by the quilt, motivates part 
of the following definition. 


Definition 5.6.2 Suppose A is a set of real numbers and C is a family of sets of 
real numbers. If 


Aes (5.12) 
SeC 


we say that C covers A, or is a cover of A. If every set in C is open, we call C an 
open cover. If F is a subfamily of C that also covers A, we call F a subcover of C. 


20 Take a hint from the hint for Exercise 5.5.4. 
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EXERCISE 5.6.3 Translate your answer to Exercise 5.6.1 into appropriate 
mathematical language to state the definition of cover in Definition 5.6.2 in a 
slightly different way. 


Example 5.6.4 LetC = {Nj (x) : x € Q}, the set of neighborhoods of radius 1 of 
all rational numbers. Then C is an open cover of R because every real number is 
within 1 of some rational. Also, {N1(7) : n € Z} is a subcover of C because every 
real number is within 1 of some integer. 


EXERCISE 5.6.5 What does it mean for C not to be a cover of A? 


Covers and subcovers are of most interest to us when the cover C is an open 
cover with infinitely many sets in it, and F is a subcover that has only finitely many 
sets from C in it. In this case, we say that C can be reduced to a finite subcover. 
In the next exercises, part of what you will show is that an open cover of a set A 
cannot be reduced to a finite subcover. The natural way to do this is to show that 
any finite subfamily fails to cover A. These exercises will be very helpful when you 
prove part of the main result of this section. 


EXERCISE 5.6.6 Letabe any real number. Then the family 
C = {(—00, a—1/n) U (a+ 1/n, +00) :n € N} (5.13) 


covers R — {a} but cannot be reduced to a finite subcover. 


EXERCISE 5.6.7 The family of intervals C = {(—n, n) : n € N} covers R but has 
no finite subcover. 


Now we are ready for the definition of compactness. 


Definition 5.6.8 A set A is said to be compact if every open cover of A can be 
reduced to a finite subcover. 


To show a given set is not compact, all we have to do is demonstrate an open 
cover that cannot be reduced to a finite subcover, as you did in Exercises 5.6.6 
and 5.6.7. It might, therefore, be pretty easy to prove that entire classes of sets are 
not compact in this way. However, to demonstrate that a given set is compact from 
Definition 5.6.8 would appear to be quite a task, for something rather powerful 
must be shown about every conceivable open cover of the set. Thankfully, someone 
has been down this road before and supplied us with a relatively painless way to 
determine precisely which sets are compact. 


Theorem 5.6.9 (Heine-Borel). A set is compact if and only if it is closed and 
bounded. 
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We will piece together the proof of Theorem 5.6.9 in stages by way of several 
exercises. The next two exercises comprise the => direction of the proof. 


EXERCISE 5.6.10 Suppose A is not closed. Demonstrate an open cover of A 
that has no finite subcover.”! 


EXERCISE 5.6.11 Suppose A is not bounded. Demonstrate an open cover of A 
that has no finite subcover.”* 


To prove the < direction of Theorem 5.6.9, the following two theorems lay 
most of the groundwork. We will prove the first one here, for it is quite intricate. 
We leave the second as an exercise. 


Theorem 5.6.12 The interval [a, b] is compact. 


Proof. Suppose C is an open cover of [a,b]. We construct a set E, which is 
by definition a subset of (a, b], in the following way. For x € (a, b], let x be 
in E provided some finite subfamily of C covers the interval [a,x]. First we 
show that E is non-empty and bounded from above. Then we show that its 
LUB is b. 

The fact that E is bounded from above is easy, for it is a subset of (a, b] by 
definition, thus bounded from above by b. To show that E is non-empty, we 
note that since C covers [a, b], then there is some S, that contains a. Since S; 
is open, there exists €; > 0 such that Ne, (a) C S. Thus the subfamily {51} covers 
[a, a+ €1/2],so that a+ 1/2 € E. Thus E is not empty. 

Since E is a non-empty set bounded above by J, it has LUB L < b. First we 
show that L € E. Then we show that L = bin order to have that the entire interval 
[a, b] can be covered by a finite subfamily of C. 

Since C is an open cover of [a, b] and L < b, there exists S$) € Csuchthat L € 5S). 
Since Sz is open, there exists €2 > 0 such that N¢,(L) € So. Since L is the LUB 
of E, there exists some x € (L — €2, L] NM E. Thus [a, x] can be covered by a finite 
subfamily F2 C C. But then F2 U {59} is a finite subfamily of C that covers [a, L], 
so that L € E. 

Now suppose L < b. Since S2 is open, there exists €3 > Osuch that N.,(L) © Sp. 
Let y = min{L + €3/2, b}. Then y > L, y € So, and y € [a, b]. So the interval [a, y] 
can be covered by a finite subcover of C. This contradicts the fact that L is the 
LUB of E. Thus L < bis impossible, and L = b. 


Knowing that [a, b] is compact moves us very close to showing that any closed 
and bounded set is compact. The following exercise brings us within epsilon of 
being finished. 


21 Use the characterization of closed sets from Exercise 5.4.12 and Exercise 5.6.6. 
22 Use Exercise 5.6.7. 
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EXERCISE 5.6.13 IfA C B, where A is closed and B is compact, then A is also 
compact.?? 


Now we have all the pieces we need to prove Theorem 5.6.9. 


Proof of Theorem 5.6.9. 


(=) If A is not closed, then Exercise 5.6.10 implies that A is not compact. If A is 
not bounded, then Exercise 5.6.11 implies that A is not compact. 


(<) Suppose A is closed and bounded. Then there exists M > 0 such that 
A C [—M, M]. By Theorem 5.6.12, [—M, M] is compact. Since A is a closed 
subset of a compact set, A is compact by Exercise 5.6.13. 


You might be a little disappointed that we went to all the trouble to define a 
new term (compactness) in complicated terms of open covers and subcovers, when 
it turns out that compact sets are precisely those that are closed and bounded, two 
ideas we already have a handle on. Well, you would be right to say there is no 
necessity to define a new term if the class of things to which the term applies can 
be easily described in already familiar language. But the point remains that closed 
and bounded sets have a very important feature: that every open cover can be 
reduced to a finite subcover. We need this feature in Section 7.7. 


23 Tf C is an open cover of A, then the fact that AC is open should suggest a way to expand C into an open 
cover of B by the addition of one more set. 
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6.1 


Sequences of Real Numbers 


Sequences Defined 


We can approach the subject of sequences in several ways. Let’s move from an 
informal to a more formal way. A sequence of real numbers is an ordered list 


(1, 42, 43, ..+5An,--.) (6.1) 


Listing the terms of a sequence in order might bring back memories of showing a 


set A is countable, for this listing is equivalent to constructing a function f : N —— 


A, where f(1) = a1, f(2) = a2, and so on. A sequence can therefore be defined as 

a function from the positive integers into the reals. The notation (a,)°°, is one 

standard way of denoting a sequence. Beginning the list with a; is a matter of 

convenience. There might be times when it seems more natural to begin with apo. 
If there is a formula for a,, say, for example, a, = 1/n, we have 

(2)0) = (1,5, 55+) (6.2) 


n/n=1 


We must distinguish (6.2) from {1/n}"° ,, which is merely the set of elements of 
the sequence (1/n) and does not have the ordering of the elements as one of 
its defining characteristics. Saying a, = 1/n makes the idea of a sequence as a 
function more natural, for we are talking about nothing other than f:N— R 
defined by f(x) = 1/n, and the set {1/n} can then be thought of as the range of 
the sequence. Then, if we want to visualize the sequence graphically, we can, as in 
Figure 6.1. Such a graph will help in visualizing limits in Section 6.2. Example 6.1.1 
presents some examples of sequences we will refer to later. 


Example 6.1.1 


1. The sequence (a,)~, defined by a,= 5 +(-1)". 5 is the sequence 
(0,1,0,1,0,1,...). 

2. Letting a, =n + (—1)" forn > 0 generates (1,0, 3,2,5,4,7,6,...). 

3. Ifa, = sinnz, then we generate the sequence (0). 
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1- e 


1/2 e 


1/3 5 e 


1/4 4 ° 
1/5 5 e 


1/n 5 e 


The sequence ay = f(n) = 1/n. 


4. We can define a, in cases. For n > 0, let 


0, if n is even 
aA, = 
" \n2—1 ifnisodd 


to generate the sequence (0, 0, 0, 8, 0, 24, 0, 48, 0, 80, 0, .. 


5. Ifa, =n*/2” forn > 1, then we generate the sequence 


1 9 25 36 49 64 
(G.1. 3.1 a) 


Monotone Sequences 


(6.3) 


oy 


An important class of sequences derives from the following definition. 


Definition 6.1.2 A sequence of real numbers (a,)°°, is said to be increasing 
(or nondecreasing) provided a, < a, whenever m < n. It is said to be decreasing 
(or nonincreasing) provided ay, > a, whenever m <n. If (a,) is either increasing 
or decreasing, then it is said to be monotone. If am < a, whenever m <n, then 
(an) is said to be strictly increasing. If am > an whenever m <n, then (a,) is said 


to be strictly decreasing. 


If a and b are positive real numbers such that a < b, then Exercise 2.2.8 implies 
that 1/a > 1/b. It follows that if (a,) is an increasing (or decreasing) sequence of 
positive real numbers, then (1/ay) is a decreasing (or increasing) sequence of all 
positive numbers. The result is similar for sequences of negative real numbers. 


This suggests one way to approach the next example. 
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Example 6.1.3 Show that the sequence defined by a, = n/(n + 1) for n > 1 is 
strictly increasing. 


Solution Weshow thatthe sequence (1/a,) is strictly decreasing. Suppose m < n. 
Then 1/m > 1/n, so that 1/am =1+1/m >1+1/n =1/a,. Thus the sequence 
(1/ay) is strictly decreasing. a 


Given a positive real number x, the sequence (x”) proves to be important in a 
number of different contexts. 


EXERCISE 6.1.4 Letxbea positive real number. Then the sequence (x")"° | is 
strictly increasing if x > 1 and strictly decreasing if 0 < x < 1.! 


EXERCISE 6.1.5 What does it mean to say that a sequence is not increasing? 


EXERCISE 6.1.6 Give an example of a sequence that is both increasing and 
decreasing. 


EXERCISE 6.1.7 Determine whether the following sequences are increasing, 
decreasing, or neither. Prove your claims. 

(a) (1-1 n 

(b) (1/n? 

(c) (+ er 


Bounded Sequences 


Boundedness of sequences is defined in precisely the same way we defined 
boundedness of sets. 


Definition 6.1.8 A sequence of real numbers (a,) is said to be bounded from 
above provided there exists a real number M, such that a, < M, for all n. 
Similarly, the sequence is said to be bounded from below provided there exists 
areal number M2 such that a, > M2 for all n. The sequence is said to be bounded 
provided there exists M > 0 such that |a,| < M for all n. If a sequence is not 
bounded, it is said to be unbounded. 


The boundedness terms in Definition 6.1.8 read very much like those in 
Definition 5.1.1. In fact, to say that a sequence is bounded from above, bounded 
from below, or bounded is precisely the same as saying that the range of the 
sequence is bounded from above, bounded from below, or bounded. Furthermore, 
we may apply Exercise 5.1.2 to the range of a sequence to have the following. 


! See Exercise 3.5.7(b). 
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Theorem 6.1.9 A sequence is bounded if and only if it is bounded from above 
and below. 


Example 6.1.10 From Example 6.1.1, we note without details that 


1. (5 +(-1)"- 5) is bounded above by M; = 8 and below by Mz =0. It is 
therefore bounded. 


2. (n+ (-1)") is not bounded from above; therefore it is not bounded. It is, 
however, bounded from below by M2 = 0. 


3. The sequence (a,) defined in Eq. (6.3) is not bounded from above, and thus 
is not bounded. It is bounded from below by M2 = —84. 


Example 6.1.11 We show that the sequence from Example 6.1.1 defined by 
an = n? /2" is bounded. In Exercise 3.5.16, you showed that 2” > n2 for alln > 5. 
Thus 0 < n? /2" <1 for all n > 5. Let M = max{ay, ao, a3, ag, 1}. We show that 
0 <a, < M for alln. 

Let n be a positive integer. If 1 < n < 4, then clearly 


0 < ay < max{a1, a2, a3, a4} < M (6.4) 


Ifn > 5, then 0 < a, <1 < M. Ineither case, 0 < a, < M,so that the sequence is 
bounded. 


EXERCISE 6.1.12 What does it mean for a sequence to be unbounded? 
Example 6.1.13 Show that the sequence defined by a, = ./n is unbounded. 


Solution Pick M > 0, and let n be any integer satisfying n > M*. By Exer- 
cise 3.9.4, we have that a, = ./n > VM? = M>0,so that |a,|>M. 


Since the sequence (x”) is decreasing for 0 < x < 1,it is bounded above by 1. 
Since every x” is positive, (x”) is bounded below by zero. Thus if 0 < x <1, 
the sequence (x”) is bounded. However, the next example shows that (x”) is 
unbounded if x > 1. 


Example 6.1.14 Suppose x > 1. Then (x”)°° , is unbounded. 

Solution Suppose (x”) is bounded, and let L be the LUB of the sequence. Let 
€ = x —1, which is positive. Then by property M2, there exists a positive inte- 
ger n such that L —€ <x" < L, so that L < x"+x-—1. But since x” > 1, we 
have that 


tt 8 tl _ yh a(x —1) tx" > x-1ltx" SL (6.5) 


This is a contradiction, so that (x") is unbounded. 
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EXERCISE 6.1.15 Consider the sequence (a,) defined by 


10, if n is odd 


1—n?, ifniseven 


Prove that (a,) is bounded from above but not from below. 


To close out this introductory section on sequences, here is an example that 
illustrates a technique we will use in Section 6.2. The exercises that follow will be 
similar. 


Example 6.1.16 Consider the sequence defined by a, = (n+ 3)/(n —1) for 
n> 2. 


(a) Find a positive integer N such that |ay — 1| < 0.001. 
(b) Does the inequality in part (a) hold for alln > N? 


Solution 


(a) Expanding the inequality |ay —1| < 0.001 according to Theorem 2.3.4 
and then solving for N yield the following equivalent inequality state- 


ments. 
—0.001 < ea —1<0.001 
N—-— 
4 
—0.001 < i < 0.001 


—0.001(N — 1) < 4 < 0.001(N — 1) 


N > —3999 and N> 4001 


The inequality N > 4001 is stronger than N > —3999, so we may let N = 4002. 
(b) Suppose n > N. Then 


4 
HL 0S——] 2 
eS ene, ST 


< 0.001 (6.6) 


so that ja, — 1| < 0.001 for alln > N. a 


EXERCISE 6.1.17 Let (an) be defined as in Example 6.1.16, and let e« > 0 be 
given. Find a value of N (which will be in terms of €) such that |ay — 1| < «. Show 
that ja, —1| < eforalln > N. 
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EXERCISE 6.1.18 Let (an) be defined by ay = (n + 3)/(2n — 1), and suppose 
€ > 0 is given. Find a value of N such that |ay —1/2| <«, and show that 
lan — 1/2| < €foralln > N. 


EXERCISE 6.1.19 Let (ay) be defined by a, =n/(n+ 1),and suppose0 <¢€ <1 
is given. Find a value of N such that |ay| > €, and show that |a,| > ¢ foralln > N. 


1. et (a,) be define An = 1/(n* +1), and suppose € > 0 is 
EXERCISE 6.1.20 L (an) be defined by 1/(n* + 1), and supp Oi 
given. Find a value of N such that |ay| < €, and show that |a,| < € for alln > N. 


EXERCISE 6.1.21 Suppose ¢ > 0, and (a,) and (b,) are two sequences with 
the following properties. If n > Ni, then |a, — Li| < €/2, and if n > N2, then 
|b, — L2| < €/2. Given this information, you should be able to make a true 
statement by filling in the blanks below. 

Ifn> ? ,then|(a,+5,) —(L1 + L2)| < __?_. (6.7) 


’ 


Fill in the blanks with the appropriate numbers, then prove that the resulting 
statement is true. 


Convergence of Sequences 


One of the most important questions we can ask about a sequence of real numbers 
is whether it converges. If a sequence converges to a real number, its terms are 
“homing in on” or “approaching” some real number as n takes on larger and 
larger values. We can also talk about a sequence converging to positive or negative 
infinity. In this section, we define these notions and prove some important basic 
results about convergent sequences. 

It would be misleading not to say up front that the idea of convergence cre- 
ated no small amount of controversy and distress in the historical development of 
mathematics. It took quite some time for a satisfactory definition to be devised. 
Sir Isaac Newton, in developing the calculus, assumed some pretty sweeping 
results concerning convergence in his work, only to have its details placed on 
a firm footing later. Because the idea is a little complicated, we will work our way 
into it a little at a time. 


Convergence to a Real Number 


Figure 6.2 illustrates convergence to a real number L. Given some «€ > 0, the 
e-neighborhood of L contains all the terms in the sequence beyond some threshold 
term ay. In order to visualize how you might prove that the terms of a sequence 
(dn) are converging to L, imagine the following task. Someone gives you ane > 0. 


2 You will have to treat the cases ¢ > 1 and € < 1 separately. 


Figure 6.2 
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A convergent sequence. 


You then take that € value and do a quick calculation to determine some positive 
integer N (which will depend on €) with the property that the Nth term and all 
those after it fall in the e-neighborhood of L. If someone else then gives you an 
even smaller € > 0, you can still find a threshold value N with the same property, 
though it will probably be a higher threshold. If you are given any € > 0 and are 
able to find some positive integer N with the property that all terms from the Nth 
one onward fall in the e-neighborhood of L, then you will have shown that the 
sequence converges to L. 

Before we give the definition of convergence, note that you have already done 
some of this sort of work in the exercises of Section 6.1. In Exercise 6.1.17, you 
showed that 


n+3 
—1 
1 


4 
Ifn > 1+ -,then <€. 
€ 


n— 
That is, all terms with an index larger than 1 + 4/e fall in the e-neighborhood of 1. 
In Exercise 6.1.18, you showed that 


n+3 1 
2n—-1 2 


1 7 
If ~+ —,th 
jee, wale en 


That is, all terms with an index larger than 1/2 + 7/4e fall in the e-neighborhood of 
1/2. Note that in both of these exercises, smaller values of € yield higher thresholds 
for n. Now we are ready for the definition of convergence. 


Definition 6.2.1 Suppose (an)P°, is a sequence of real numbers. We say that 
the sequence converges if there exists a real number L such that, for every € > 0, 
there exists a positive integer N with the property that n > N implies |a, — L| < e. 
That is, the sequence converges provided 


(AL ER)\VWe > O)GN EN)\Wn EN) = N = lan — LI < ©€) (6.8) 
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If (an)°° , converges to L, we say that L is the /imit of the sequence as n approaches 
infinity, and we write this as 


im. a, = L (6.9) 
We also say that a, approaches L as n approaches infinity, and we write this as 


a, > Lasn—> oo (6.10) 


If a sequence does not converge, we say it diverges. 


To construct a proof of convergence is, as always, to follow the logical flow of 
the definition. A general form for such a proof would look something like this. 


Theorem 6.2.2 (Sample). The sequence (a,)°° , converges to L. 


Proof. Let « > 0 be given. Let N be any integer satisfying N > f(€).° Then if 
n > N,we have that 


lan — L| = f~\(n) < f-'(N) <e (6.11) 


Thus a, ~ Lasn > o. 


Here is a cleaned up example. 
Example 6.2.3 The sequence ((n + 1)/(4n + 3)) converges to 1/4. 


Solution Let « > 0 be given. Let N be any positive integer satisfying N > 
1/16¢ — 3/4. Then if n > N, we have 


1 
16N + 12 


n+1 1 7 1 
4n+3 4| |16n+12 


| < | <€ (6.12) 


Thus (1 + 1)/(4n + 3) — 1/4asn > oo. | 


If you wonder what inspired the statement N > 1/16 — 3/4, it is the result 
of working backward from the last step of the inequality in (6.12) and solving 
for N to produce a logically equivalent inequality. This scratchwork is what makes 
inequality (6.12) work. 

Notice that the logical flow of Theorem 6.2.2 and Example 6.2.3 does not 
exploit the expression (AL € R) given in Definition 6.2.1 because the supposed 
limit is provided in the statement of the theorem. This is typical of convergence 
proofs of this sort, because your theorem will almost certainly come with the limit 
as part of the package. 


3 Where you have already done the scratchwork to find what f(c) should be. 
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EXERCISE 6.2.4 Write polished proofs that the following sequences from 
Section 6.1 converge to the given limit. 


(a) (n+3)/(n — 1) > 1 (Exercise 6.1.17) 
(b) 1/(n? + 1) > 0 (Exercise 6.1.20) 
EXERCISE 6.2.5 By stripping off the (AL € R) in the definition of convergence, 


state what it means for a sequence (a,) not to converge to a given real number L 
as n — 00. 


Specific examples of convergence are great, but we need generalized theorems 
addressing convergence in order to develop a broader theory of convergence. The 


following results go a long way by giving us some basic building blocks and ways 
of combining them. 


EXERCISE 6.2.6 The limit of a sequence of real numbers is unique.* 


Theorem 6.2.7 Let c be areal number. Then the constant sequence defined by 
an = c converges to c. That is, lim, 4c = c. 


Proof. Let « > 0 be given. Let N = 1. Then ifn > N, we have that 


la, — L| =|c—c| =0<e 


EXERCISE 6.2.8 The sequence 1/n > Oasn > oo. 
EXERCISE 6.2.9 Show that 1/2” > 0asn > oo. 

The proof of the next theorem reveals a useful trick in showing how a sequence 
converges, so take note. We exploit the convergence of (a,,) by applying its defining 
feature to «/M. 

Theorem 6.2.10 Suppose a, — Oand (b,) is bounded. Then a,b, > Oasn —> oo. 
Proof. Let « > 0 be given. Since (b,) is bounded, there exists M > 0 such that 
|bn| < M for all n. Now €/M is also positive. Since a, — 0, there exists a positive 


integer N such that |a,| < «/M for alln > N. Thus if n > N, we have 


€ 
lanbn — O| = |anbal = |an| lbn| < Vi “M=e (6.13) 


Thus a,b, > Oasn > oo. 


4 if L, and L2 are both limits and L; < L2, then letting « = (L7 — L1)/2 should yield a contradiction. 
5 Use the Archimedean property. 
® See Exercise 3.5.16. 
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Convergence is stronger than boundedness, but in one part of upcoming 
Exercise 6.2.12, all you need to know about a convergent sequence is that it is 
bounded. To prove the result in the next exercise, consider the following sugges- 
tion. If (a,) converges to L, its terms eventually settle down close to L, so that all 
the terms beyond a certain point are caught in the interval (L — 1,2 +1). Sono 
matter how wildly the sequence might jump around before this point, there are 
only finitely many terms to contain. This should suggest a value of M to serve as a 
bound. 


EXERCISE 6.2.11 A convergent sequence is bounded.’ 


EXERCISE 6.2.12 Suppose a, — L, and b, > Lz asn > oo. Then 
(a) limp—oo(an + bn) = Li +L? 
(b) limp c0(Gnbp) = E1710" 


Corollary 6.2.13 Suppose lim,_, 0a, = L and let c be any real number. Then 
Timp oo(Can) = cL. 


Proof. Let b, = c for all n in Exercise 6.2.12(b) and apply Theorem 6.2.7. 


Corollary 6.2.14 If a, — Ly and b, > Lz as noo, then (a, — bn) > 
(Li — L2) asn > ow. 


Proof. 
lim (dn — bn) = lim [an + (—1)bn] = lim ay + lim (—1)bp, 
n—->oco noo noo n->oo 


(6.14) 
y+ (-)L2=L;-L2 


With an induction argument calling on Exercises 6.2.8 and 6.2.12(b), you can 
show the following. 


EXERCISE 6.2.15 If p is any positive integer, then 1/n? > 0 asn > oo. 


Look at the reasoning we used in Example 6.1.11 where we took care of finitely many terms individually 
and then handled all the rest with a single number. A good start for this proof would be to say: “Let 
€ = 1. Then there exists ....” 

Let M = max{laj|, |az|,..., |ay—1], [Z| + 1}. 

Remember, if « > 0, then so is €/2. See Exercise 6.1.21. 

First deal with the case L; = 0 by appealing to previous results. For L; 4 0, do some preliminary 
playing around with the expression |a,by, — L1L2|. The sneaky trick here is adding and subtract- 
ing the same quantity inside |a)b, — L1L2|. If you want to know what that quantity is, check the 
next hint. 

Add and subtract by, L; and use the triangle inequality. Also, apply Exercise 6.2.11 to (bn). 
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To combine two sequences (a,) and (b,) by division to produce (a,/bn), we 
first need to do a little work on the sequence (1/b,) by itself. We would like to be 
able to say something to the effect that the limit of a quotient is the quotient of 
the limits. Well, we can, sort of. 

The sequence (1/n) contains only positive terms. However, 1/n — 0,so every 
e-neighborhood of zero contains some 1/n. Thus the terms of the sequence (1/n) 
cannot be bounded away from zero. But if the terms of a sequence are all nonzero 
and converge to a nonzero limit, then we can bound its terms away from zero. 


EXERCISE 6.2.16 Suppose (b,) is a sequence of nonzero real numbers such 
that b, — L, where L is nonzero. Then there exists M > 0 such that |b,| > M for 
alln.2-3 


Exercise 6.2.16 provides the right machinery for you to prove the following. 


EXERCISE 6.2.17 Suppose (b,) is a sequence of nonzero real numbers that 
converges to a nonzero limit L. Then limy_, 95 1/b, = 1/L. 


Corollary 6.2.18 Suppose (ay) is a sequence that converges to Lj, and (bn) is 
a sequence of nonzero real numbers that converges to a nonzero limit L2. Then 
an/by > L1/L2 asn > oo. 


Proof. Apply Exercise 6.2.12(b) to (a,) and (1/b,). 


Many of our results from Theorem 6.2.7 through Corollary 6.2.18 can take us 
a long way in dealing with a particular class of sequences. We will prove one case 
of the next theorem and leave the other case to you as an exercise. 


Theorem 6.2.19 Suppose (a,) is a sequence defined by 


_ byn" + by-nl | +--+ + bin + bo 
sn + e5-yn-! 4. +. Fen +c 


(6.15) 


where all by and cz are real numbers, b, and c, are nonzero, and the denominator 
is nonzero for all n. If r < s, then a, ~ 0 asn > co. Ifr=s, then a, > b,/cs as 
n— oo. 


Proof of case r<s. Suppose r < s. Multiply the expression for a, in Eq. (6.15) by 
n~*/n~* to have 


byn"S + by_n’ 1-8 Abate byni-s + bons 


6.16 
Cs + Cy_jn7! + sae + cyni-s + cons ( ) 


ayn = 


12 First deal with L > 0. Let e = L/2. Then there exists a positive integer N such that .... 
Bipy< 0, then use the fact that —b, > —L. 
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Since r < s, every exponent of n in Eq. (6.16) is negative, and we may apply the 
results of this section to have 


; lim(b,-n’—* +--+ + bon~*) 
lim a, = - 
n—>0o lim(c,; +--+ + con) 


_ lim(@,n"™) +--+ + lim(bon™) 
~ dimes +--+ + lim(con7) 


(6.17) 
_ (lim b,)(dimn’*) +--+ + dim bo) (dim n~™) 
~ limc, +--+ + (limco) (lim n-‘) 


_ by OF +by-0 0 
ee ee 


EXERCISE 6.2.20 Prove the case of Theorem 6.2.19 where r = s. 


EXERCISE 6.2.21 Suppose a, — L; andb, > L2 asn — oo. Suppose also that 
dn < by for all n. Show that L1 < L2.!4 


EXERCISE 6.2.22 Show thatifa, > Lasn > cv, then |a,| > |Ll asn > o.)5 


EXERCISE 6.2.23 The Sandwich Theorem: If Qn < Cn < by for all n, and if 
a, > Land b, > L,thenc, —> L. 


Convergence to Infinity 


Sometimes a sequence does not converge to a real number, but it does exhibit a 
nice behavior in that it increases (or decreases) without bound. The term we use 
for such behavior is that a, — -+oo (or —0o). We have already used the somewhat 
loose expression n — oo in Definition 6.2.1, even though infinity is not a real 
number. It is not difficult to define the notion of limy_..5 a, = oo, but there is 
no way to do it in terms of neighborhoods, unless you can concoct some sort of 
definition of a neighborhood of infinity. We will do exactly that in Section 7.4, 
where we will spend some quality time with infinity. For now, we will be content 
to work with a few basic ideas. 

Ifa, > Lasn —> oo, the terms might hop all around L, just as long as that hop- 
ping stays within an e-radius of L beyond some threshold term. That this bouncing 
around can be eventually contained for all « > 0 is why the bouncing apparently 
fades out. To say a, — +00 asn — oo means loosely that the terms a, are hopping 


14 Try proof by contrapositive. 
15 See Exercise 2.3.10. 
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increasingly higher and higher, even though there is no reason the terms cannot 
do at least a little hopping back down. Instead of selecting an arbitrary ¢-radius 
around some L, we set an arbitrary hurdle bar at some positive number M and 
insist that all terms beyond some threshold are above the hurdle. If every M is 
associated with some threshold beyond which a, > M, then we say a, — +00 as 
n — oo. Here is the definition. 


Definition 6.2.24 Suppose (a,) is a sequence of real numbers. We say that 
limn—+oco Gn = +00, provided for all M > 0, there exists a positive integer N with 
the property that n > N implies a, > M. Logically, we may write this as 


(VM > O0)GN EN)(Vn EN)(n > N > ay > M) (6.18) 


We also write a, — ++0o as n > oo, and we sometimes say that the sequence 
increases without bound. 


Example 6.2.25 Show that lim,_... ./n = +00. 


Solution Pick M > 0 and let N be any positive integer satisfying N > M*. Then 
ifn > N,we have that /n>/N>J/M2=M. @ 


EXERCISE 6.2.26 If (an) is increasing and unbounded, then a, — +00 as 


n> ©. 
An immediate consequence of Exercise 6.2.26 is that if x > 1, then x” + +00 


as n — oo. For Exercise 6.1.4 showed that (x”) is increasing for x > 1, and in 
Example 6.1.14 we showed it is unbounded. 


EXERCISE 6.2.27 Suppose (a,) is a sequence of nonzero real numbers such 
that a, > +00 asn — oo. Then 1/a, > 0asn > oo. 


EXERCISE 6.2.28 If0 <x <1,thenx" > 0asn > o. 

EXERCISE 6.2.29 If—1 <x <0,then x” > Oasn > oo. 

EXERCISE 6.2.30 In the spirit of Definition 6.2.24, create a definition for the 
statement limy-.o dn = —O. 

The Nested Interval Property 

Imagine a sequence of closed intervals ([dn, bn])°°,, where [an, bn] S (am, bm] 


whenever m <n. Such a sequence of intervals is said to be nested because 
any particular interval contains all those after it. In this section, we derive the 
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Nested Interval Property (NIP) of the real numbers, which says that a nested 
sequence of closed intervals whose lengths (b, — a,) tend to zero as n > oo will 
intersect down to a single real number. We will then investigate an important 
implication of the NIP as it relates to bounded sequences. We also demonstrate 
that the NIP can be used axiomatically to prove the LUB property, thus making 
the two statements logically equivalent. 


From LUB Axiom to NIP 


by < bm Whenever m <n. In particular, a1 < an < by < bi for all n. Thus the 
sequence of left end points (a,) is monotone increasing and bounded above 
by b;, and the sequence of right end points (b,) is monotone decreasing and 
bounded below by a1. As we begin to make our way to the NIP, we need the 
following. 


If (Lan, bn]) is a nested sequence of closed intervals, then clearly am < ay, < 


EXERCISE 6.3.1 Suppose (a,,) is an increasing sequence of real numbers that is 
bounded from above. Then (a,) converges to its LUB. 


Exercise 6.3.1 is the step in the process of proving the NIP where you will use 
the LUB axiom. The next result should follow quickly from Exercise 6.3.1 by the 
strategic insertion of some negative signs. 


EXERCISE 6.3.2 Suppose (b,) is a decreasing sequence of real numbers that is 
bounded from below. Then (b,) converge to its GLB. 


If the hypothesis conditions for Exercise 6.3.1 are satisfied, we say that (a,) 
converges to L from below, and we write a, 7 L. Similarly for Exercise 6.3.2, we 
say (b,) converges to L from above, and we write by, \, L. 

Thus we have that a nested sequence of closed intervals ([an, b,]) has the 
property that a, > L; and b, — Lz for some real numbers L; and L2. Now if 
([an, bn]) also has the property that the lengths of the intervals approach zero 
as n — oo, then we have limy-,.0(bn — dn) = 0. Because (ay) and (by) converge 
individually, then by Corollary 6.2.14, 


0= lim (by — an) = lim b, — lim ay = Lo — Ly (6.19) 
n—-oo n—>oo n—->oo 


Thus L; = L2. The last piece of the NIP puzzle asserts that this limit is the single 
element in the intersection of all the nested intervals in the sequence. Here is 
a statement of the theorem and its proof, with its last detail left to you as an 
exercise. 


Theorem 6.3.3 (Nested Interval Property). Let ([a,, b,]) be a sequence of 
nested intervals with the property that limy—oo(bn — dn) = 0. Then N° j[an, bn] 
contains precisely one real number. 
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Proof. By Exercises 6.3.1 and 6.3.2, both (a,) and (b,) converge, and by Corol- 
lary 6.2.14, they have the same limit L. We show that N°, [ay, b,] = {L}.... 


n=1 


EXERCISE 6.3.4 Finish the proof of Theorem 6.3.3. 


The NIP Applied to Subsequences 


In Section 6.4, we will need some results based on taking a given sequence and 
creating from it a new sequence by perhaps deleting some of its terms and pre- 
serving the order of the kept terms. The new sequence is called a subsequence, and 
we take the opportunity now to introduce the term and note what the NIP has to 
say about certain sequences and subsequences. 

Creating a subsequence of a given sequence can make for a little notational 
mess when it comes to indexing the subsequence, so let’s look at an example. 
Given a sequence (a,), suppose we want to consider the following subsequence. 


(a2, dg, 410, 440, 444, 466; « - -) (6.20) 
The notational dilemma can be resolved by noting that the indices 
(2, 8, 10, 40, 44, 66, ...) (6.21) 


form a strictly increasing sequence (nx), of positive integers. If we denote 


ny = 2, n2 = 8, n3 = 10, and so on, then the sequence (m1, 2,n3,...) Is a way 
of indexing our subsequence. The subsequence can then be denoted by 


(An, ’ Qn2> an3> ang are ) = (On) po (6.22) 


where k is now the indexing variable. This brings us to a definition. 


Definition 6.3.5 Suppose (an)°°, is a sequence of real numbers, and suppose 
(nx)e-, is a strictly increasing sequence of positive integers. Then the sequence 
(Any) p21 18 called a subsequence of the sequence (an). 


Suppose you love to play darts. You play every day from now to eternity, 
and you’re pretty good at it. At least you’re good enough so that you always hit 
somewhere on the target. If every dart you throw leaves a tiny hole behind, what 
must eventually happen to the set of pinholes on the target as time goes on? Well, 
it is certainly possible that your darts could always land on one of a finite number 
of spots on the target, in which case you would hit at least one spot infinitely many 
times. But if you do not hit any one spot infinitely many times, then your target 
will have infinitely many holes in it. If so, then the finite size of the target will 
imply that the pinholes are going to cluster in one or more places. Of course, if 
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we think of two direct hits on the same exact spot as leaving two distinguishable 
holes (why not?), then it can indisputably be said of your target that the holes are 
somewhere clustered and around at least one spot. 

This little analogy is going somewhere. Suppose (a,) is a sequence of real 
numbers that is bounded by M > 0. Then the interval [—M, M] is the dart board 
target, and the terms of the sequence are the spots you hit. The next theorem says 
that there are an infinitely many terms of the sequence clustered around some 
point in [—M, M], but it does so in the language of subsequences. It will come in 
really handy in Exercise 6.3.7, where you prove the famous Bolzano-Weierstrass 
Theorem. 


Theorem 6.3.6 Every bounded sequence of real numbers contains a conver- 
gent subsequence. 


Proof. Suppose (a,) is bounded by M, and denote [co, do] = [—M, M]. (See 
Fig. 6.3.) Let mg be the midpoint of [co, do], and note that either [co, mo] or [mo, do] 
must contain infinitely many of the a,. (Perhaps both do.) Let [c1, d;] be one of 
the subintervals of [co, do] that contains infinitely many of the a,, pick any term 
of the sequence from [c1, di], and denote it as a,,. Repeat this process by letting 
m1 be the midpoint of [c1, di], letting [c2, d2] be whichever of [c1, m1] or [m1, di] 
contains infinitely many of the a,, and then picking any term of the sequence from 
[c2, d2] whose index exceeds nj, and call this term a,,. Continuing in this fashion, 
we generate two things: a nested sequence of intervals [cz, d,] whose lengths are 
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Figure 6.3 Nested intervals generated in the proof of Theorem 6.3.6. 
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2M/2* and therefore tend to zero (Exercise 6.2.9), and a subsequence (an,)24, 
each term of which falls in [c,, d,]. 

By the NIP, N72, Lex, dk] = {xo} for some real number xg. Since cy — xo and 
dy — Xo as k — oo, and since cx < an, < dx for all k, the Sandwich Theorem 
(Exercise 6.2.23) implies that an, > xo. 


EXERCISE 6.3.7 Prove the Bolzano-Weierstrass Theorem: Every bounded, 
infinite set of real numbers has a cluster point.!® 


From NIP to LUB Axiom 


Let’s go back and note how the LUB axiom has contributed to the results we have 
developed thus far. We said in Section 3.10 that assumption A22 can be derived 
from the LUB axiom, though we have not proved this is true. All the results of 
Sections 3.9 and 3.10 are based on A22, and hence depend on the LUB axiom as 
well. Certainly, a lot of our work in Chapter 5 relied on the LUB axiom. Section 5.1 
was all about the LUB axiom. Theorem 5.3.14 used the LUB axiom to show that 
the empty set and the real numbers are the only subsets of the real numbers that 
are both open and closed. In Section 6.1, the only places we used any results from 
the LUB axiom were in choosing the maximum value of a finite set in proving 
some results about boundedness. Consequently, the theorems from Section 6.2 
involving boundedness stem from the LUB axiom. 

We now want to delete the LUB axiom from our list of assumptions and replace 
it with the NIP. We can then show as a theorem that the set of real numbers 
has the LUB property. To do this, we retain our terminology of sequences and 
convergence. Assuming the NIP, the next exercise is the result we want to prove. 
If you want more elaboration on the task and some suggestions for how to tackle 
it, read the comments that follow. 


EXERCISE 6.3.8 Every non-empty set of real numbers that is bounded from 
above has a least upper bound in the real numbers.!7 


Your task is to use the NIP to prove the LUB property as a theorem. That is, 
somehow you are going to use what you know about the set to generate a sequence 
of nested intervals whose lengths tend to zero, so that you can apply the NIP to 
this sequence. If you want more guidance, read on for a thumbnail sketch of the 
proof. 

Suppose A is a non-empty set bounded above by M. Since A is non-empty, we 
can choose some x € A. We can then split the interval [x, M] in half repeatedly. At 


You can apply Theorem 6.3.6 by creating just the right sequence from the set, or you can mimic the 
proof of Theorem 6.3.6 in its use of the NIP. 
Prove that the single element of N[an, by] has properties L1-L2 by contradiction. 
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Figure 6.4 Nested intervals generated in the proof of Exercise 6.3.8. 


6.4 


each stage, keep the right half of the split interval if it contains any elements of A; 
otherwise, keep the left half. (See Figure 6.4.) The successive splits will produce 
a sequence of nested intervals whose lengths tend to zero. The NIP then gives us 
a single real number, the intersection of all the intervals, which you can show has 
properties L1-L2 (or M1 and M2 if you prefer). 


Cauchy Sequences 


Now we turn our attention to a certain class of sequences called Cauchy sequences. 
If we can say that convergent sequences are characterized by the fact that 
the terms all eventually bunch up around some real number (the limit), then 
we say that Cauchy sequences are characterized by the fact that the terms all 
eventually bunch up around each other, without any specific reference to a 
possible real number limit. This bunching of terms is logically equivalent to 
convergence to a real number limit (with the help of the NIP), and this is the 
main result we want to prove in this section. Once we work through these 
results, we will discuss why Cauchy sequences are important in more general 
mathematical settings, and we relate their convergence to the LUB axiom and 
the NIP. 
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Figure 6.5 A Cauchy sequence. 
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Convergence of Cauchy Sequences 


First let’s define what we mean for a sequence to be Cauchy. Then we will show 
that a sequence is convergent if and only if it is Cauchy. 


Definition 6.4.1 A sequence of real numbers (a;,) is said to be Cauchy provided 
that, for all « > 0, there exists a positive integer N such that |a,, — a,| < € for all 
m,n>N. 


One way to envision a Cauchy sequence in terms of some given € > 0 and 
the corresponding threshold N is that all terms of index at least N fall within 
an ¢€ distance, not only of ay, but of each other as well. See Figure 6.5. 
There is no apparent limit around which the neighborhood is anchored, so 
the question of whether a Cauchy sequence must converge is not immediately 
answerable. In showing that convergence and Cauchy are equivalent, let’s do 
the easy part first. The next exercise says merely that if the terms of a sequence 
bunch up around a real number limit, then they must bunch up around each 
other. 


EXERCISE 6.4.2 A convergent sequence is Cauchy. 
EXERCISE 6.4.3 What does it mean for a sequence (a,) not to be Cauchy? 


If a sequence is not Cauchy, then by Exercise 6.4.2 it does not converge. Some- 
times a natural way to show that a particular sequence diverges is to show it is not 
Cauchy. 


EXERCISE 6.4.4 Show that the sequence ((—1)") is not Cauchy and hence 
diverges. 


To prove that a Cauchy sequence is convergent, we need the following lemma. 
To prove it, mimic your proof of Exercise 6.2.11. Only now, instead of fencing in 
the tail of the sequence around a known limit, bound it around the threshold term 
ayn by using € = 1. 
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EXERCISE 6.4.5 A Cauchy sequence is bounded. 


At this point, you have all the machinery you need to show that a Cauchy 
sequence is convergent. Here is a statement of the theorem, with some suggestions 
for the proof following. 


EXERCISE 6.4.6 A Cauchy sequence of real numbers is convergent. 


If a sequence is Cauchy, then by Exercise 6.4.5 it is bounded. Thus it con- 
tains a convergent subsequence by Theorem 6.3.6. This gives you a real number 
limit to work with, and you can show that the entire sequence converges to 
this same limit by using convergence of the subsequence and Cauchiness of the 
sequence. 

The real numbers are only one example of a set where we address the con- 
vergence of Cauchy sequences. In any set endowed with a metric (a measure of 
distance between elements), Cauchy sequences can be defined in the same way as 
in Definition 6.4.1 by interpreting |a,, — a,| as the distance between elements in 
that context. Any such metric space, as we call it, where Cauchy sequences always 
converge to an element in the space is said to be complete. 

In some situations, completeness might be taken as an axiom of the metric 
space. What we have done here is to show completeness of the real numbers as 
a theorem, based on the NIP, hence based on the LUB axiom. Some authors 
of texts in analysis prefer to go the other way by assuming completeness as an 
axiom of the real numbers and demonstrating the NIP and LUB property as 
theorems. Of course, this means that completeness is logically equivalent to both 
the LUB property and the NIP, and we will show that completeness of the real 
numbers implies the NIP. By Exercise 6.3.8, completeness also implies the LUB 
property. 

One good reason an author might want to take completeness as an axiom 
is that it makes for a pretty neat way to fill in the spaces between the rational 
numbers with the irrationals and make the real number line into the smooth 
continuum we envision. It also is a natural way to arrive at a form of closure in the 
steps we take to build the real numbers from the set {0, 1}. In a nutshell, it goes 
like this. 

In the way we described in Section 3.8, we begin with the set {0,1} and 
expand it to the whole numbers in order to achieve closure of addition. We 
then expand the whole numbers to the integers to achieve closure of additive 
inverses, and then we extend the integers to the rationals to have closure of 
multiplicative inverses for nonzero elements. Now envision the rationals as a 
set of points on the number line with all the irrationals missing, and consider 
the set of all possible sequences of rational numbers. A problem with the ratio- 
nals is that many of the Cauchy sequences will not converge to a rational limit. 
There are a lot of unproved assumptions in the next example, but it illustrates the 
point. 


6.4.2 
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Example 6.4.7 The terms 


aj=1 
a=1.4 
a2 = 1.41 
a3 = 1.414 


ay, = 1.41421356237 


represent the start of a sequence that converges to V2. Every term in the sequence 
is rational, for it is a terminating decimal. The sequence is Cauchy, for, given e > 0, 
let N be a positive integer such that 10-" <e. Then if m,n > N, |dm —an| < € 
because a,, and a, will agree through the Nth decimal place. 


So the rational numbers are not a complete set, which means that they do not 
contain all possible limits of all Cauchy sequences. What are you going to do? If 
you want the rational numbers to be embedded in a larger set that is complete, you 
are going to have to throw in all possible limits of Cauchy sequences of rational 
numbers. With a little bit of work, you can fill in the holes in the number line with 
these irrational limits. 


From Completeness to the NIP 


If we can show that completeness of the real numbers implies the NIP, then we 
will have demonstrated the logical equivalence of the LUB property, the NIP, and 
completeness. Here are some reminders and a suggestion about how to prove the 
final exercise of this section. 

In proving the NIP from the LUB property in Section 6.3, we supposed 
([an, bn]) is a nested sequence of closed intervals whose lengths tend to zero as 
n — oo. That the length of the intervals goes to zero did not by itself say anything 
about convergence of (a,) or (b,) separately. The intervals might get arbitrarily 
short, but if they are not nested, then nothing prevents them from waltzing forever 
up the real number line, so that neither (a,) nor (b,) converges. That the intervals 
are nested meant that (a,) and (b,) are bounded. We then applied monotonicity 
and the LUB property to the a, and b,, which put a real number limit into our 
hands. This limit was just the number we needed to arrive at the NIP. 

This time, in proving the NIP from completeness, we still begin with the assump- 
tion that the intervals are nested and that their lengths tend to zero. The difference 


206 


Chapter 6 Sequences of Real Numbers 


this time is that the axiom we apply is completeness. If we can show that (a,) and 
(b,) are Cauchy, then the assumption of completeness will give us a real number 
limit for each. This turns out to be fruitful, for the fact that (b, — a,) > 0 makes 
|by — a,| eventually very small. To conclude that (a,) is Cauchy, b, in the expres- 
sion |b, — a,| can be replaced by a, for m > n to have |am — a,|, which is even 
smaller than |b, — a,|. The rest of the work in proving the NIP would be identical 
to what you did in Section 6.3, where we observed lim a, = lim b,, and where you 
showed in Exercise 6.3.4 that N[a,, b,] = {L}, where L is the common limit of (a,) 
and (b,). So here is the only result we need to bridge the gap from completeness 
to the NIP. 


EXERCISE 6.4.8 Suppose ([a,, b,]) is a sequence of nested intervals such that 
(by — An) > 0 asn > ov. Then (a,) is Cauchy. (A similar argument would show 
that (b,) is Cauchy.) 


7.1 


Functions of a Real Variable 


Functions whose domain is a subset of the real numbers are called functions of a 
real variable, and functions whose range is a subset of the real numbers are called 
real-valued. Some especially powerful and interesting results can be deduced if 
the domain of such a function is compact. Because this chapter deals exclusively 
with real-valued functions of a real variable, we will write f : S > R, where it is 
understood that S is a subset of the real numbers. 


Bounded and Monotone Functions 


First we discuss boundedness and monotonicity of a function f : S > R. With 
these and other ideas from Sections 7.2—-7.4, we will point out many links to 
properties of sequences we discussed in Chapter 6. 


Bounded Functions 


The definitions of the boundedness terms for functions resemble those for 
sequences. 


Definition 7.1.1 Suppose f : S > Risa function and A is a subset of S. We say 
f is bounded from above on A provided there exists a real number M; such that 
F(x) < M, for all x € A. Similarly, we say f is bounded from below on A provided 
there exists a real number Mp) such that f(x) > M2 for all x € A. If there exists 
M > 0 such that | f(x)| < M for all x € A, we say f is bounded on A. For each of 
these characteristics, if it applies on the entire domain, we say simply that f is 
bounded from above, bounded from below, or bounded. 


EXERCISE 7.1.2 Use the logic of Definition 7.1.1 to state what the following 
terms mean. 


(a) f isnot bounded from above on A. 
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F(x) = 1/x. 


(b) f isnot bounded from below on A. 

(c) f isnot bounded on A. 

EXERCISE 7.1.3 For f(x) = 1/x on the interval (0, co) show the following. (See 
Fig. 7.1.) 

(a) f is bounded on (1, ov). 

(b) f is bounded from below on (0, 1). 


(c) f isnot bounded from above on (0, 1). 


Monotone Functions 


When we defined a sequence to be monotone increasing, the fact that the terms 
of the sequence are lined up in an order made the definition easy to come by—if 
we pick two indices m <n, it must be that a», < d,. A similar definition works for 
functions. 


Definition 7.1.4 Suppose f:S — R is a function, A is a subset of S, and 
a,beé A. Then f is said to be increasing (or nondecreasing) on A provided 
f(a) < f(b) whenever a < b. Similarly, f is said to be decreasing (or nonincreas- 
ing) on A provided f(a) => f(b) whenever a < b. If f(a) < f(b) whenever a < b, f 
is said to be strictly increasing on A. If f(a) > f(b) whenever a < b, f is said to 
be strictly decreasing on A. For each of these characteristics, if it applies on the 
entire domain, we say simply that f is increasing, decreasing, strictly increasing, 
or strictly decreasing, and we group functions of these types into the class we call 
monotone functions. 
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EXERCISE 7.1.5 Show f(x) = 1/(? + 1) is strictly decreasing on [0, oo). 


In Exercise 3.5.7, you proved the following theorem, even though we did not 
have the language of monotonicity at the time. 


Theorem 7.1.6 Let x be any positive integer. Then: 


1. f(x) = x*"-1 is strictly increasing on the set of real numbers. 


2. f(x) =x?" is strictly decreasing on (—oo, 0] and strictly increasing on [0, 00). 


In your pre-calculus work, you probably learned about the horizontal line 
test, an intuitive way of determining whether a function f is one-to-one. If every 
horizontal line in the xy-plane crosses the graph of f no more than once, then f is 
one-to-one. If f does not map onto the real numbers, we can crop the codomain of 
f down to Rng f so that f : S > Rng f is onto. Thus if f : S — Rng f is one-to- 
one, then f~! : Rng f — Swill exist. To find the expression for f~!, you switched 
the roles of x and y in the expression for f, then solved for y. You also learned 
that the graph of f~! could be sketched by reflecting the graph of f about the 
diagonal line y = x. This reflection is the visual result of swapping x and y. Think 
intuitively for a moment. If f is a strictly increasing function, then does it pass 
the horizontal line test? If so, then f~! exists on Rng f. Given this, can you say 
anything about the monotonicity of f~!? 


Theorem 7.1.7 If a function is strictly monotone on A, then it is one-to-one 
on A. 


EXERCISE 7.1.8 Prove Theorem 7.1.7 for the case of an increasing function. 
(A similar argument would work for a decreasing function.) 


As an immediate consequence of Exercise 7.1.8 and Theorem 4.4.7, we have 
the following. 


Corollary 7.1.9 If f is strictly monotone on A C S, then there exists an inverse 
function f—!: f(A) > A. 


Letting A = Sin Corollary 7.1.9, we have that if f : S > Risstrictly monotone, 
then f-!: Rng f > S exists. Furthermore, the following is true. 


Theorem 7.1.10 If f: S > Ris strictly increasing (or strictly decreasing), then 
f—!: Rng(f) > S is strictly increasing (or strictly decreasing). 


EXERCISE 7.1.11 Prove the case of Theorem 7.1.10 where f is increasing. 
(A similar argument would work if f is decreasing.) 


Theorems 7.1.6 and 7.1.10 imply that %/x is strictly increasing on [0, oo) and 
2n-V/x is strictly increasing on R. Also, with the next exercise, the function f(x) = x" 
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is strictly increasing on [0, oo) for any rational number r. For we may write r = m/n 
for positive integers m and n,so that f(x) = Vx". 


EXERCISE 7.1.12 Suppose f and g are real-valued functions such that go f is 
defined on some set S. If f and g are both increasing, then so is go f. 


EXERCISE 7.1.13 State and prove other results analogous to Exercise 7.1.12 
by varying the hypothesis conditions on f and g to include all combinations of 
increasing and decreasing. 


Limits and Their Basic Properties 


Suppose f is a function that is defined on some deleted neighborhood of a. 
Whether f(a) exists is beside the point in the definition of limit, so we insist on no 
more than a deleted neighborhood. In a way somewhat similar to our definition 
of convergence of sequences as n — ov, we discuss lim,-., f(x), the limit of f(x) 
as x approaches a. In this section, we will work our way into the definition of the 
limit of a function, and then look at some basic theorems involving limits that bear 
striking parallels to those theorems involving sequences. 


Definition of Limit 


Let’s work our way into a definition of lim,_., f(x) gradually. Let f(x) be defined 
in the following way: 


4x—1, ifx4#2 
x= 71 
he 50, ifx=2 me) 


Defining f in this way produces a very familiar straight line, but with a hole 
punched in the graph at x =2 and the value of f(2) artificially (but purpose- 
fully) defined to be a number way out of line with what you would expect it to 
be. (See Fig. 7.2.) Even though f(2) = 50, the function does not take on values 
anywhere near 50 if x is close to but different from 2. To the contrary, if x is 
in some small deleted neighborhood of 2, f(x) appears to take on values close 
to 7. The next example asks the question that will lead us into the definition of 
limit. 


Example 7.2.1 Let f be defined as in Eq. (7.1), and suppose ¢€ > 0 is given. 
For the e-neighborhood of 7 on the y-axis in Figure 7.2, we want to find a radius 
5 > 0 of a deleted neighborhood of 2 on the x-axis so that every x in the deleted 
neighborhood maps into N,(7) on the y-axis. That is, we want to find 5 > 0 so that 
| f(x) — 7| < « whenever 0 < |x—a| <6. 


Figure 7.2 


7.2 Limits and Their Basic Properties 211 


40 + 


30 5 


f defined from Eq. (7.1). 


Solution Since the only values of x we care about are different from 2, we may 
use f(x) = 4x — 1 in the inequality | f(x) — 7| < «. Thus we have the following 
equivalent inequalities. 


7—€<4x-1<T7+e 


2-f<x<24+ 


€ 
= Oe 
eATaG 


By letting 6 = €/4 we guarantee that if 0 < |x —2| < 6,then |f(x)-7| <«. 


Example 7.2.1 illustrates the following point, which is at the heart of the 
definition of lim,_,, f(x). Given a “target” of radius « around y = 7, we are able to 
find a deleted neighborhood around x = 2 such that all values of the function 
in the deleted neighborhood of x = 2 hit somewhere in the target neighbor- 
hood of y = 7. If you imagine a sequence of smaller and smaller «-values, it is 
always possible to find sufficiently small deleted neighborhoods of x = 2 that map 
entirely within the smaller and smaller targets. If such a deleted 5-neighborhood 
can be found for all e-neighborhoods of y = 7, then we say that lim,_,2 f(x) = 7. 
Here is the definition. 


Definition 7.2.2 Suppose f: S— R is a function defined on some deleted 
neighborhood of a, and L is a real number. Then we say lim,_,, f(x) = L provided 
for all « > 0, there exists 5 > 0 such that, for all x € S, 0 < |x —al < 6 implies 
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| f(x) — L| < e. If lim, f(x) = L, we say f converges to L as x > a. Another 
way to write this is f(x) ~ Lasx— a. 


To write Definition 7.2.2 symbolically, we would have 


lim f(x) = L & (We > 0)(5 > O)(¥x € S)(0 < |x —a| <8 > |flx) - LI <6) 
(7.2) 


Here is an example of a proof based on Definition 7.2.2. Some scratchwork 
similar to the inequalities in Example 7.2.1 has been omitted. 


Example 7.2.3 Let g be defined by 


_ 3x? + 17x +20 


g(x) = pas (7.3) 


Note that the natural domain of g is all real numbers except x = —4. Show 
lim,-,—4 g(x) = —7/2. 


Solution Let « > 0 be given, and let 6 = 2e/3. Now if x 4 —4, a factor of (x + 4) 
may be canceled from the numerator and denominator of g. Thusif0 < |x + 4| < 6, 


we have 
Pygetles 3x? + 17x+20 7 
ae 2x+8 2 
| Bx+5)\a+4) i 
2(x + 4) 2 (7.4) 
(3x45 7) [3x4+12 
i on i) 2B 
3(x+4)| 3 35 


Thus lim,_,_4 g(x) = —7/2. 1 


In the same way that f from Example 7.2.1 seemed a little contrived because 
of the special effort we took to define f(2) = 50, g from Example 7.2.3 might seem 
contrived because nothing would seem to prevent us from canceling the (x + 4) 
factor from numerator and denominator. Granted, g is merely the straight line 
y = 3x +5 with a hole punched in the domain at x = —4 by the introduction of 
an (x + 4) factor in the numerator and denominator. But f and g are good first 
examples to illustrate that the limit of a function such as x > a has nothing to do 
with the value or even the existence of the function at x = a. Whether the function 
exists at a and is the same as the limit as x > a is called continuity, and we’ll look 
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at that in Section 7.5. There are plenty of examples in which a hole in the domain 
occurs naturally, and we will look at one example later in this section. 


Basic Theorems of Limits 


There is a real similarity between the definition of convergence of a sequence as 
n — oo and convergence of a function as x — a. In the convergence of sequences, 
a given € > 0 must be associated with a threshold term beyond which all terms of 
the sequence fall in the e-neighborhood of the limit. In the convergence of a func- 
tion f,a given « > 0 must be associated with a 5-radius of a deleted neighborhood 
of a, within which all values of the function, except perhaps f(a), must fall in the 
e-neighborhood of the limit. The good news is that the similarity of these def- 
initions makes for plenty of similar theorems, with similar proofs to match. So 
here is a barrage of theorems involving convergence of a function. As we present 
them, we will often link them back to sequence theorems and properties. We prove 
some of them here to illustrate the similarities, but you will prove most of them 
as exercises by mimicking your work from Section 6.2. 


Theorem 7.2.4 For any real numbers a and c, the constant function f(x) = c 
satisfies lim,g f(x) = c. 


Proof. Let « > 0 be given, and let 6=1. Then if 0 < |x—a| < 6, |f(x) —cl = 
lc —c| = 0 < e. Thus lim,-_,g f(x) =c. 


EXERCISE 7.2.5 For any real number a, limy.,.x = a. 


Theorem 7.2.6 If f(x) > Lasx —> a, then there exists a deleted neighborhood 
of a where f is bounded. 


Proof. Suppose f(x) > L as x >a. Then there exists 6 >0 such that 0 < 


|x —a| <6 implies that | f(x) -—L| <1. Let M=1+|L|. Then M>0 and if 
0 < |x —a| < 6, we have 


[fl =|f@) -L+L| s |f@)—-L)+|L| <1+|Ll|=M 


EXERCISE 7.2.7. If f(x) ~ 0 as x > a, and if g is bounded on a deleted 
neighborhood of a, then f(x)g(x) > Oas x > a. 

EXERCISE 7.2.8 If f(x) > Ly, and g(x) > L2 asx > a, then asx > a: 

(a) fot+e@ > Lli+L2 

(b) fg) > LiL2 
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Three results follow as immediate corollaries of Theorem 7.2.4 and 
Exercise 7.2.8. 


Corollary 7.2.9 If f(x) > Lasx — a,and if cis any real number, then cf(x) > 
cLasx— a. 


Corollary 7.2.10 If f(x) > Li and g(x) > L2 as x > a, then f(x) — g(x) > 
Li —Ly2asx— a. 


Corollary 7.2.11 If P(x) = apx” + an_1x""! +---+ a,x + ao, then for any real 
number a, P(x) > P(a) asx > a. 


We state the next result in a way slightly different from the corresponding 
result for sequences. 


EXERCISE 7.2.12 Suppose g(x) > L as x > a, where L is nonzero. Then there 
exists a deleted neighborhood of a where g can be bounded away from zero. 
In particular, if L > 0, there exists M > 0 and 6 > 0 such that 0 < |x—a| <6 
implies g(x) > M.If L < 0, there exists M < 0 and 6 > O such that 0 < |x —a| <6 
implies g(x) < M. 


EXERCISE 7.2.13 Suppose g(x) > L as x > a, where L is nonzero. Then 
1/g(x) — 1/Lasx— a. 


Corollary 7.2.14 Suppose f(x) > Li and g(x) > L2 #0 as xa. Then 
S(x)/g(x) > L1/L2 asx > a. 


Theorem 7.2.15 Suppose f is defined by 


Pu(x) — apx’ + (ee sae Sere eee 
Po(x)  Bsx8 + Bs x1 +--+ byx + bo 


fx) = 


Then, provided P2(a) 4 0, f(x) > Pi(a)/P2(a) asx > a. 


Proof. Apply Corollaries 7.2.11 and 7.2.14, and the result follows. 


If f(x) > Land g(x) > Lasx > a, then f(x) — g(x) > 0. Thus if € > 0, there 
exists 6 > 0 such that 0 < |x —a| < 6 implies | f(x) — g(x)| < €/2. This fact will 
come in handy in proving the following. 


EXERCISE 7.2.16 [Sandwich Theorem] Suppose f, g, and / are functions 
with the property that g(x) < h(x) < f(x) for all x in some deleted neighborhood 
of a. Suppose also that f(x) > Land g(x) > Lasx > a.Thenh(x) > Lasx > a. 


Earlier we promised an example of a function with a natural hole in the 
domain where we want to address the limit. The function is f(x) = sinx/x at 
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a = 0. Except for a passing glance at sine in Section 6.1, we have never mentioned 
any trigonometric, exponential, or logarithmic functions. There is a reason for this: 
Definitions of these functions that are rooted in the axioms of the real numbers 
are not possible to come by at this stage of our game. Only the algebraic func- 
tions arise from the theory we have developed thus far, and they can make for 
some pretty sticky proofs by themselves. Strict definitions of these nonalgebraic 
functions, called transcendental functions, come later. However, if we kick back 
for a while and give ourselves the freedom to talk about sine, cosine, and tan- 
gent in the familiar language of the unit circle, then we can apply Exercise 7.2.16 
to show 


in =a (7.5) 


x>0 XxX 


In trigonometry, we take a real x-number line and “wrap” it around the unit 
circle in the uv-plane with x = 0 placed at (u, v) = (1,0) and the positive half 
of the x-axis wrapped counterclockwise. (See Fig. 7.3.) For a real number x, we 
define cosx and sinx to be the u and v coordinates, respectively, of the point 
where x falls in the wv-plane. If 0 < x < 2/2, Figure 7.3 shows how we may view 
x as an arc length, and sin x, cos x, and tan x as the lengths of segments in Quad- 
rant I. Similar triangles in the figure will convince you that the length labeled tan x 
is correct. If 0 < x < 2/2 as is suggested in the sketch, the geometry of the unit 
circle reveals 


sinx <x < tanx (7.6) 


Figure 7.3 Basic trigonometric functions. 
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If —2/2 < x < Oso that x is in Quadrant IV, then sin x and tan x are also negative, 
and we have 


tanx <x <sinx (7.7) 


If we take both parts of the inequality in (7.6) separately and solve each for sin x/x, 
we can put them back together into the single inequality 


cos x < sd <1 (7.8) 
Xx 


If we do the same thing for (7.7), remembering that x < 0 and sinx < 0, we also 
arrive at (7.8), so that (7.8) holds for all 0 < |x| < 2/2. Let’s accept from the 
geometry of the unit circle and the definition of cosx that cosx > 1 as x > 0. 
Then, by Exercise 7.2.16, Eq. (7.5) holds. 


EXERCISE 7.2.17. Assume that —1 <cosx <1 for all x, and show that 
lim,_,9 xcosx = 0. 


Let’s remind ourselves of some of the theory of Cauchy sequences in order to 
motivate our last theorem from this section. An immediate result of convergence 
of a sequence is that it is Cauchy (Exercise 6.4.2). If the terms of a sequence get 
close to some limit, then they must get close to each other. Similarly, we can make 
the following claim, which says that if f(x) converges as x — a, then the values of 
f must not vary much from each other in small deleted neighborhoods of a. 


EXERCISE 7.2.18 Suppose f(x) —~ L as x > a. Then for every € > 0, there 
exists 6 > 0 such that x1, x2 € DNs(a) implies | f(x1) — f(x2)| < €. 


If the conclusion statement in Exercise 7.2.18 does not hold, then there exists 
€ > Osuch that for all 6 > 0, there will exist x1, x2 € DNs(a) with | f(x1) — f(x2)| = 
e. That is, there is some € > 0 such that every deleted 5-neighborhood of a contains 
two points whose functional values are at least € apart. If a function f has this 
property, then lim,_,, f(x) does not exist. 


Example 7.2.19 Let f be defined in the following way: 


1, ifx is rational 
f(x) = 


7.9 
0, if x is irrational 7) 
If ais any real number, then lim,_,, f(x) does not exist. For we may let e = 1/2 and 
choose any 6 > 0. Then by Exercises 5.2.4 and 5.2.5, the deleted 5-neighborhood of 
acontains a rational number x; and an irrational number x2, and | f(x1) — f(x2)| = 
1 > e. This function is sometimes called the salt and pepper function because its 
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values of 0 and 1 are sprinkled up and down the domain like grains of salt and 
pepper. No one knows for sure which is the salt and which is the pepper. 


EXERCISE 7.2.20 The signum function, sgn x, is defined in the following way: 


-1, ifx<0 
senx=4 0, ifx=0 (7.10) 
1, ifx>0 


Show lim,_,9 sgn x does not exist. 


EXERCISE 7.2.21 Use familiar values of sin x to show that lim,_,9 sin(1/x) does 
not exist. 


More on Limits 
One-Sided Limits 


In defining lim,_., f(x) = L, we insisted f be defined on both sides of a and all 
points to the nearby left and right of a map into N,(L). If f is only defined on one 
side of a, or perhaps if values of f to the immediate left of a behave differently 
from those to the immediate right (as in sgn x), we can discuss the limit of f(x) as 
x approaches a from the left or from the right separately. Instead of using entire 
deleted neighborhoods of a, we use only the left or right half of them. 


Definition 7.3.1 Suppose f is defined on the interval (a,b). Then we say 
lim,_,,+ f(x) = L (read “as x approaches a from the right”) if for all « > 0, there 
exists 6 > 0 such that a < x < a+ implies | f(x) — L| < €. We call L the right- 
hand limit of f at a. Similarly, if f is defined on the interval (c,a), we say 
lim,.,- f(x) = L (x approaches a from the left) if for all « > 0, there exists 
5 > O such that a — 6 < x < aimplies | f(x) — L| < €. We call L the left-hand limit 
of f ata. 


EXERCISE 7.3.2 Given a function f limy—+a f(x) = Lif and only if 
lim, f@) = lim f@=L (7.11) 


Example 7.3.3 Let f be defined in the following way: 


2 . 
x-—1, ifx<2 
a me 


(7.12) 
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By Theorem 7.2.15, limy_.2(x2 — 1) = 3andlim,_,2 1/(x — 1) = 1 (inthe two-sided 
sense). Thus, by the > direction of Exercise 7.3.2, lim,_,>- (x2 — 1) =3 and 
lim,_,9+ 1/(x — 1) = 1. But then lim,_,5- f(x) = 3 and lim,_,.+ f(x) = 1, so that 
lim,+2 f(x) fails to exist by the < direction of Exercise 7.3.2. 


EXERCISE 7.3.4 Let f be defined in the following way: 


2senx, ifx<1 


FORD | a4 tes 4 


(7.13) 


Evaluate the following, with reference to applicable results. 
(a) limyo- f@) 

(b) limy.o+ f@) 

(c) limy-o f(@) 

(d) lim,.1- f@) 

(e) limy+1+ f@) 

(f) limy1 f() 


EXERCISE 7.3.5 Show lim,_,o+ /x = 0. 


Sequential Limits 


Suppose f is a function defined on a deleted neighborhood of a, where f(x) > L 
as x > a. Let (a,) be any sequence of real numbers that converges to a, but 
where every a, is different from a. Now consider the sequence (f(a,)). (See 
Fig. 7.4.) It seems intuitively clear that f(a,) > L as n — ov, and this is a fairly 
straightforward fact to demonstrate. 


f(a) 


Figure 7.4 Convergence of a function in terms of sequences of domain elements. 
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What might be surprising is that the converse of this result is also true. If every 
sequence of real numbers converging to a (where all the a, are different from 
a) produces a sequence of functional values that converges to L, then f(x) > L 
as x > a. One way to prove this is by contrapositive. Suppose f(x) does not 
converge to L as x — a, claim the « that this guarantees, and then use a sequence 
of progressively smaller and smaller 5-values to create a sequence (a,) where 
all a, are different from a and that converges to a, but where f(a,) does not 
converge to L. 


EXERCISE 7.3.6 Suppose f : S > Risa function. Then lim,_,q f(x) = Lif and 
only if every sequence (a,) such that a, — a and a, # a, for all n also has the 
property that f(an) > L. 


The logical equivalence of our €-6 definition of limit and the sequential limit 
characteristic in Exercise 7.3.6 means that it is possible to define the statement 
limy+a f(x) = L using either an €-é definition or a sequential limit definition. Some 
authors prefer to define function limits by defining lim,_,, f(x) = L provided every 
sequence (a,) where every a, # a anda, — aasn — oo implies that f(a,) > L. 
Our e-6 feature would then be a theorem. They then construct proofs of theorems 
such as those from Section 7.2 from the theory of sequences we discussed in 
Section 6.2. If you already have the theory of sequences under your belt, function 
limit proofs become very easy. For example, let’s return to Exercise 7.2.8(a): If 
f(x) > Ly and g(x) > Lz asx > a, then f(x) + g(x) > L, + L2. A proof using a 
sequential limit definition and exploiting the theorems about sequences would go 
something like this. 


Proof of Exercise 7.2.8(a). Suppose f(x) > L1 and g(x) > Lz asx — a. Let (ay) 
be a sequence such that a, > a and a, #a for all n. Then since f(x) > Lj as 
x — a, we have that f(a,) > Li asn > oo. Similarly, since g(x) > L2 as x > a, 
then g(a,) > L2 asn —> oo. By Exercise 6.2.12, f(an) + g(an) > Li + L2. Since 
(an) was chosen arbitrarily, we have f(x) + g(x) > Li+L2asx—- a. 


Perhaps you feel a little cheated at this point. After all, if only we had proved 
Exercise 7.3.6 at the beginning of Section 7.2, then all our proofs about limits 
would have been one-liners. There is some truth to that. However, e€-5 proofs 
pervade mathematics, and the fact that your first €-6 proofs could be done easily 
by merely mimicking work from Section 6.2 was probably a humane way for you 
to be introduced to them. 


EXERCISE 7.3.7 Use Exercise 7.3.6 to show that lim,_,9 sin(1/x) does not exist 
by demonstrating an appropriate sequence. 


Limits Involving Infinity 


Now we want to extend the concepts and language of limits to include the symbols 
+oo, even though infinity is not a real number. We have used the symbo! for infinity 
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before in our work with sequences when we defined limy_, 50 an. However, our use 
of the symbol was merely a formal one, because convergence was defined solely 
in terms of the indexing set of positive integers. In this section, we want to take 
the expression lim,_,, f(x) = L and replace either a or L (or both) with one of the 
symbols too. Up until this point, the fact that a and L are real numbers allowed 
us to discuss e-neighborhoods of L and 6-neighborhoods of a. With our current 
definition of neighborhood, it makes no sense to talk about a neighborhood of 
infinity, unless of course we decide to give this term meaning. We will do precisely 
this. In fact, as we go, we will concoct extended, new meanings for old language 
and revamp some of our visual imagery to show that there just might be some way 
to understand infinity in a way that we can almost treat it as a number. 

The language we will use in replacing a or L with too will go something like 
this. In discussing lim,-, +. = L, we call these limits at positive or negative infinity. 
When we discuss lim,_,4 f(x) = +00, we call these limits of positive or negative 
infinity. Graphically, a limit at infinity corresponds to a horizontal asymptote in the 
graph of f, and a limit of infinity corresponds to a vertical asymptote. There are 
buckets and buckets of ways to combine and specialize the ideas we will discuss 
here. With both +oo and —oo, with either a or L or both being replaced by these 
symbols, with two-sided and one-sided limits, we can define a whole bunch of new 
terms. By hitting a few, you will catch on to what the others ought to be, so we will 
not be exhaustive. 


Limits at Infinity 


Let’s begin by replacing a with +00, because it almost exactly replicates the theory 
of sequences. Since a sequence is just a real-valued function whose domain is the 
positive integers, our way of graphing a sequence as we did in Figure 6.1 makes 
convergence as n — oo easy to visualize as a horizontal asymptote of discrete 
points in the plane. If we imagine filling in values of the function to other real 
numbers so that the domain is some interval (a, +00), then the following definition 
seems to be a natural adaptation of Definition 6.2.1. 


Definition 7.4.1 Suppose f is defined on some interval (a, +00). Then we say 
limy-+ +00 f(x) = L provided for all « > 0, there exists a real number M such that 
x > M implies | f(x) — L| < «. 


Notice the similarity between Definitions 7.4.1 and 6.2.1. Given € > 0, there 
exists a threshold point in the domain beyond which all values of the function fall 
in the e-neighborhood of L. This definition gives rise to a whole slew of theorems 
involving limits of functions at +oo, where the proofs are identical to those for 
sequences. 


Theorem 7.4.2 lim;,.40c=c. 


Theorem 7.4.3 limy-.+400 1/x = 0. 
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Theorem 7.4.4 Suppose f(x) > Li and g(x) > L2 as x > +00. Then the 
following hold as x > +00. 


1. fy—+ eg) > 114+ 12 
2. fixg(x) > L1L2 
3. f(x)/g(x) > L1/Lz2 (if Lz 4 0) 


On and on the theorems go that exactly parallel our previous work. The limit 
of polynomial over polynomial, and even a sandwich theorem, seem strangely 
translucent. 


EXERCISE 7.4.5 Create a definition for lim,+-o f(x) = L for a function 
defined on an appropriate interval. 


Now let’s extend our language of neighborhood to include infinity. When we 
say f(x) > Las x — a,we mean that any neighborhood of L has a corresponding 
deleted neighborhood of a that maps into it. Is there a way to use the same 
language for a limit at infinity? Could we say that any neighborhood of L has a 
corresponding neighborhood of +oo that maps into it? We can of course, if we 
define a neighborhood of +00 to be an interval of the form (M, +00). Similarly, a 
neighborhood of —oo could be defined as an interval of the form (—oo, M). 

We are catching a glimpse of the extended real numbers and are developing an 
imagery of two phantom points too somewhere way off the left and right ends of 
the number line. This is the standard way of creating the extended real numbers, 
which are denoted as either R or [—00, oo]. 

An apparent difference between neighborhoods of a real number and neigh- 
borhoods of +00 is that the latter are not two-sided. However, if we use the single 
symbol oo instead of both +oo, then we can create a nice way of visualizing the 
extended real numbers, where +oo and —oo are merged into one point. Here is 
one way we might do that, which is similar to the extension of the xy-plane to 
include the point at infinity in complex analysis. 

Imagine a real number line with a circle sitting on top of it as in Figure 7.5. 
We map a given real number a to a corresponding point (x, y) on the circle with 
the help of the diagonal line in the figure. This geometric way of mapping each 


(~%Y) 


Figure 7.5 Mapping the extended real numbers onto a circle. 
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real number to a point on the circle suggests a one-to-one function from the real 
numbers onto all points of the circle except the north pole (0, 1). If you want an 
explicit formula for the coordinates of the point (x, y) in terms of the value of a, 
you can get it easily enough with the help of similar triangles and the equation for 
the circle, but it is not necessary to understand the principle. 


EXERCISE 7.4.6 Use similar triangles and the equation of the circle from 
Figure 7.5 to determine the xy-coordinates of the image of a real number a under 
the bijection that sends the real numbers to all points on the circle except (0, 1). 


With this new imagery, the real numbers are no longer conceived as an infinite 
line but as a circle, except that one point of the circle has not been associated with 
a real number in this mapping. We call the point at (0, 1) the point at infinity. 

The extension of the real numbers to R by introducing positive and negative 
infinity as strange “end points” of the number line is the more common conception 
of the extended real numbers. The point is that oo are nothing more than formal 
symbols thrown in, along with a bunch of algebraic rules about how you are 
supposed to use them. We have therefore arrived at the place where lim,., f(x) = 
L has meaning for all real numbers L and all —oo < a < +00. 


Limits of Infinity 


Now let’s replace L with too in the expression limy¢ f(x) = L. 


Definition 7.4.7 Suppose f is defined on some deleted neighborhood of a. We 
say lim,+a f(x) = +00 provided for all M > 0, there exists 6 > 0 such that 0 < 
|x —a| < dimplies f(x~) > M. 


Definition 7.4.7 is a two-sided definition, so the graph of a function f for which 
limy+a f(x) = +00 will have a vertical asymptote at a, both sides of which head 
upward. 


Example 7.4.8 Show lim,_,9 1/x? = +00. 


Solution Pick M > 0. Let 6 =1/M. Then if 0 < |x| < 6, it follows that 


1 1 1 
= z= 35> 35=M 7.14 
Fa) xe [xl = 2 ( ) 


EXERCISE 7.4.9 In the spirit of Definitions 7.4.1 and 7.4.7, create definitions 
for the following statements. 
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(a) limy+a f(x) = —00 
(b) lim,_.4+ f(x) = +00 
(c) lim,_,a- f(x) = +00 
(d) limy++00 f(x) = +00 


(ec) Tim +00 S() = —-C 
(f) lim, —o0 Sx) = +00 
(g) lim, —o0 Sx) = -C 


EXERCISE 7.4.10 Givena function f, limysa f(x) = +00 if and only if 


lim, f(x) = lim f(x) = +00 (7.15) 


What other kinds of theorems can we expect for limits of infinity? Can we prove 
something that resembles Theorem 7.4.4? The answer is yes, but the theorems will 
look different because the limits are not necessarily real numbers. Consequently, 
we have to begin with Definition 7.4.7 and do some of the work from scratch. The 
parts of the next exercise are ordered so that some of the later ones follow quickly 
from the earlier ones. This should make your work a little more efficient. These 
results and those in Exercise 7.4.12 could just as easily be stated and proved in 
terms of one-sided limits. 


EXERCISE 7.4.11 Suppose f(x) > L, g(x) ~ +00, and h(x) > +00 asx > a. 
Then as x > a, 

(a) —g(x) > -00 

(b) 1/g(%) > 0 

(c) f(x) + g(x) > +00 

(d) fg) > +o if L > 0 

(e) fadgx) > -oif L <0 

(f) Noconclusion can be drawn about f(x) g(x) if L = 0. 

(g) f(x)/g@) > 0 

(h) g(x) +h(x) > +00 

(i) g(x)h(x) > +00 

(j) Noconclusion can be drawn about g(x) — h(x) or g(x)/h(x). 


If f(x) > 0 as x > a, then we can say something about 1/f(x) under certain 
circumstances. 


224 


7.5 


7.5.1 


Chapter 7 Functions of a Real Variable 


EXERCISE 7.4.12 Suppose there exists a deleted neighborhood of a such that 
f(x) > 0 for all x in the deleted neighborhood. Suppose also that f(x) > 0 
as x > a. Then 1/f(x) > +00. Similarly, if f(x) <0 for all x in the deleted 
neighborhood and f(x) > 0, then 1/f(x) —~ —oo. 


In the same way that we use the notation x > at and x > a” to mean x 
approaches a from the right and left, respectively, we can create a shorthand 
notation for functions that behave as in Exercise 7.4.12. We write f(x) > 0* to 
mean that f approaches zero and is positive on a deleted neighborhood of a, and 
we say f approaches zero through positive values. Similarly we write f(x) > 07 
to mean that f approaches zero and is negative on a deleted neighborhood of a, 
and we say f approaches zero through negative values. 


EXERCISE 7.4.13 Let f(x) = (x? — 4)/(x — 1) for all real numbers x 4 1. Find 
with verification lim,_,;- f(x) and lim,_,1+ f(x). 


Continuity 


The word continuous may already be a part of your mathematical vocabulary. 
Perhaps your calculus class delved into continuity enough to provide an e-6é defi- 
nition. More than likely, your notions of continuity are probably best summarized 
as a belief that a continuous function can be sketched without lifting your pen- 
cil off the paper. The graph is one clean, easily drawable piece. Even though 
such a view can be helpful in your understanding of some characteristics of 
continuity, it is far from true that all continuous functions are so easily draw- 
able. The bizarre examples of undrawable continuous functions will come later 
in your study of mathematics. For now, we define the terms and study the basic 
results. We begin with continuity at a single point in the domain, and then we talk 
about continuity on a subset of the domain. In the same way that we talk about 
left-hand and right-hand limits, we will then talk about left continuity and right 
continuity. 


Continuity at a Point 


If limy—a f(x) exists, it describes the behavior of f near a but not at a. It is possible 
that lim,, f(x) = L, while f(a) might either fail to exist or exist and be different 
from L. If a is in the interior of the domain of f, if lim,_,, f(x) exists, and if this 
limit is the value of f(a), we give this phenomenon a name. 


Definition 7.5.1 Suppose f : S > R is a function and a € Int(S). We say that 
f is continuous at a provided 


lim fo) = fla) (7.16) 
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If f is not continuous at a, we say that f is discontinuous there or that f has a 
discontinuity at a. 


Let’s reword Definition 7.5.1 in the language of ¢€, 6, and neighborhoods. 
Continuity is a small step logically from limit, for all we do is delete the word 
deleted when we discuss neighborhoods of a. Definition 7.5.1 can be reworded to 
say that f is continuous at a provided 


(We > 0)(G6 > 0)(Vx € S)(\x — al < 6 > | f(x) — f(a| < 9 (7.17) 


Compare (7.17) to the symbolic form of the definition of limit in (7.2). The only 
difference is that |x —a| <6 replaces 0 < |x —a| < 6. And since |x —a| <6 is 
weaker than 0 < |x —a| < 6, continuity is therefore stronger than the existence 
of a limit. 

Because of the short step from limit to continuity, our work with limits in 
Section 7.2 takes us a long way in the theory of continuous functions. If we 
go back to the results from Section 7.2, replacing L with f(a) and converting 
deleted neighborhoods to neighborhoods, we arrive immediately at the following 
theorems. 


Theorem 7.5.2 The constant function f(x) =c is continuous at every real 
number. 


Theorem 7.5.3 The identity function is continuous at every real number. 


Theorem 7.5.4 If f is continuous at a point, then there exists a neighborhood 
of the point where f is bounded. 


Theorem 7.5.5 If f and g are both continuous at a point, then so are f + g, fg, 
and f — g. 


Theorem 7.5.6 Suppose g is continuous at a and g(a) 4 0. Then there exists 
a neighborhood of a where g can be bounded away from zero. In particular, if 
g(a) > 0, there exists M > 0 and 6 > 0 such that |x — a| < 6 implies g(x) > M. If 
g(a) < 0, there exists M < 0 and 5 > 0 such that |x — a| < 6 implies g(x) < M. 


Theorem 7.5.7 If gis continuous at a and g(a) £ 0, then 1/g is continuous at a. 


Theorem 7.5.8 If f and g are both continuous at a and if g(a) 4 0, then //g is 
continuous at a. 


Corollary 7.5.9 A polynomial function is continuous at every real number. 
Every rational function f(x) = Pi(x)/P2(x), where P; and P2 are polynomials, 
is continuous at every real number a for which P2(a) # 0. 


Theorem 7.5.10 A function f is continuous at a if and only if every sequence 
(an) such that a, — a satisfies f(a,) > f(a). 
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Notice that the hypothesis condition of Theorem 7.5.10 does not require a, 4 a 
as Exercise 7.3.6 does. For if any of the a, =a, then f(a,) = f(a), so that the 
sequence (f(a,,)) is defined for all n. 

Here is an important result we could not address with limits alone. A com- 
position function go f is continuous at a point if f and g are continuous at the 
right points. To prove this, an arbitrarily chosen e-neighborhood of g[ f(a)] pro- 
duces a 51-neighborhood of f(a), and this 51-neighborhood of f(a) produces a 
59-neighborhood of a. 


EXERCISE 7.5.11 Suppose f is continuous at a and g is continuous at f(a). 
Then go f is continuous at a. 


Another way to write what Exercise 7.5.11 says is 
lim g(f@)) = g(lim flx)) = g(fdim ») = g(f@) (7.18) 


With a little more mathematical machinery than we have up to now, Eq. (7.18) 
comes in handy in certain manipulations of limits. If we assume for the moment 
that all the functions below are continuous at the points involved, Eq. (7.18) allows 
us to write something like 


lim ¥1+e-" = ’ lim (1 + e-*) 
x33 x>3 


= /14+ lime” 
x23 (7.19) 


=y 1 + elim,.3(—x?) 
=VJl+e? 


We could not have a theorem in Section 7.2 that said something like “If f(x) > 
Ly asx > aand g(x) > L2 as x > 1}, then (go f)(x) > L2 as x > a” because 
a possible hole in the domain of g at f(a) might cause troublesome gaps in the 
domain of go f around a. 


Example 7.5.12 Let f(x) = xcos(1/x) and g(x) = sinx/x, and let a= 0. Now 
cos(1/x) is bounded, so that f(x) > 0 as x — 0. Also, by the presence of cos(1/x), 
f(x) = 0 for infinitely many values of x in every neighborhood of zero. Thus 
(go f)(x) fails to exist at infinitely many points in every neighborhood of zero 
because g(0) is not defined. Mf 


If f is discontinuous at a point it could be for one or more of three basic 
reasons. If Eq. (7.16) is not satisfied it might be that 
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1 2 3 4 5 6 


Figure 7.6 Some examples of discontinuities. 


(D1) lim,—.¢ f(x) does not exist; 
(D2) f(a) does not exist; or 
(D3) lim,—.¢ f(x) and f(a) both exist, but are not equal. 


Figure 7.6 illustrates some of these possibilities at a=1,2,3,4,5. At a=1, 
f behaves much like sin(1/x) near x = 0. (See Exercise 7.2.21.) 


EXERCISE 7.5.13 For the function sketched in Figure 7.6, state which of the 
characteristics D1—D3 describe the discontinuities at x = 1, 2,3, 4,5. 


The discontinuities at 2 and 4 in Figure 7.6 are called removable discontinuities 
because it is possible to define or redefine the value of f to make it continuous 
there. The discontinuity at 5 is called a jump discontinuity. 


Example 7.5.14 The function f(x) = sinx/x has a removable discontinuity at 
zero. Thus the function 


sin x ifx £0 
—— x 

gxy= 4} x (7.20) 
1, ifx=0 


is continuous at zero because lim,_.9 f(x) =1= f(0). Hf 


Example 7.5.15 Thesalt and pepper function (Example 7.2.19) is discontinuous 
at every real number, for lim,_., f(x) fails to exist for every realnumbera. 


The next exercise presents a well-known function that is related to the salt and 
pepper function. It provides an interesting first glimpse into functions in which 
continuity and discontinuity can coexist in functions whose behavior might not 
seem so odd at first glance. 
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EXERCISE 7.5.16 Define f: 0,1) — Rin the following way. 


, ifx = p/q is rational (p and q relatively prime) 


1 
fe) =24 (7.21) 
0, 


if x is irrational 

Then f is continuous at every irrational and discontinuous at every rational.! 
Continuity on a Set 

The definition of continuity on a subset of the domain is pretty straightforward if 


the subset is open, for then we are guaranteed that every point is contained in a 
neighborhood entirely within the domain of the function. 


Definition 7.5.17 If f : S— R is a function and A is an open subset of S, we 
say f is continuous on A provided it is continuous at every point in A. Logically, 
we may write this as 


(Va € A)(We > 0)(A6 > 0)(Wx € A)(|x — al < 5 > | f(x) — f@| < 9 (7.22) 


Notice that the only difference between statements (7.17) and (7.22) is that 
the latter begins with (Va € A). To construct a proof that f is continuous on a set 
A by going back to € and 6, you would begin by picking both a € A and € > 0. We 
want to do precisely this in an example now. We want to show that f(x) = 1/x is 
continuous on (0, +00). Yes, it can be said that continuity of f(x) = 1/x on (0, +00) 
follows from Theorem 7.5.9. But we are trying to illustrate that finding a value of 
6 will depend not only on the value of €, but on the arbitrarily chosen point in the 
domain at which we are working. 

To begin the construction of a proof that f(x) = 1/x is continuous on (0, +00), 
pick a > 0 and e > 0. As usual in an «-é proof, we must find 5 > 0 so that the 
inequality 


Ix —al < 6 (7.23) 


will be at least as strong an inequality as 


<€ (7.24) 


So we work backwards from (7.24), trying to transform it into something of the 
form (7.23), making sure that the steps we take do not produce inequalities that 
get any weaker. One way to rewrite (7.24) is 


! For any positive integer n, there are only finitely many rational numbers with denominator g <n. 
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la—x|_ |x—al 


(7.25) 


|xa| |x| Jal 


Now ais a given positive number, and provided we choose 6 to be sufficiently 
small, x will be sufficiently close to a and therefore be positive also. Furthermore, 
to make inequality (7.25) true, we must bound |x — a| from above, and we must 
bound |x| from below by some number, so that we can replace |x| with this number 
in the inequality. We must take both of these considerations into account to find 
6, and here is one way to do that. 

First, we can insist that the neighborhood around a be sufficiently small so 
that x > a/2. This is equivalent to a — x < a/2, so we insist that 5 < a/2. Second, 
assuming x > a/2, we would have 1/x > 2/a, to have 


|x — al 


a2 /2 


<e€ (7.26) 


Multiplying this through by a”/2 suggests that |x — a| needs to be smaller than 
ea’ /2. Thus if we let 5 = min{a/2, ea*/2}, then we are sure that 4 is positive and 
that inequality (7.25) will be satisfied. A proof that 1/x is continuous on (0, +00) 
would then go something like this. 


Proof. Let a and « be given positive numbers. Let 5 = min{a/2, ea”/2}, which 
is clearly positive. Then if |x — a| < 6, we note first that x > a — 6 > a/2 because 
6 < a/2. Furthermore, since 6 < ea /2, 


1 1 


xX a 


_ ja—x| _ |x-al |x —al 
lxa| —s |x| a a? /2 


<e€ (7.27) 


Thus 1/x is continuous at a. 


Notice a very important fact. The value of 5 we proposed does depend on both 
€ and a. In fact, the closer a is to zero, the smaller our value of 6 will be. On the 
basis of our work, it appears there is no single value of 6 that would depend only 
on € and work for all positive a. It might occur to you, however, that someone else 
might have taken a different approach and stumbled across a value of 6 that was 
not tied to the value of a but depended only on e. 

The question is an important one and will be answered when we study uni- 
form continuity in Section 7.7. Suffice it to say for now that no such 6 as a 
function of € alone exists for f(x) = 1/x on (0, +00). Intuitively, it might seem 
plausible when we consider the vertical asymptote of 1/x at zero. In Figure 7.7, 
imagine setting an e-tolerance around a point 1/a on the y-axis. If a is very 
close to zero, the graph of f is very steep at the point (a,1/a) and varies a 
lot even in a small neighborhood of a on the x-axis. Granted, for any a there 
is a 6-neighborhood of a that maps into N,[f(a)]. But if you imagine smaller 
and smaller values of a, then the asymptote of the graph necessitates smaller 
and smaller values of 6. No single 5-value will work for all a because of this 
asymptote. 
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/ate pH 


a 


F(x) = 1/x. 


The scratchwork you will go through in the next exercise is not nearly as 
involved as it has been for our example here, but you will learn a bit more about 
using the inequality f(a) — « < f(x) < f(a) + € and working backwards to find an 
inequality of the form a — 6 < x < a+ 6 that is at least as strong. A peculiar thing 
for ./x on (0, 00), however, is that it actually is possible to find a 6 that is indepen- 
dent of the value of a. Though it is true that the graph of ./x gets very steep as 
x approaches zero from the right, it does not behave asymptotically there. Why 
such a 6 can be found will become clear in Section 7.7. 


EXERCISE 7.5.18 Use an <6 proof to show that f(x) = ./x is continuous at 
anya> 0. 

One-Sided Continuity 

Even if lim,+¢ f(x) does not exist, there might still be a hope that either 


lim,_,,- f(x) or lim,_,,+ f(x) exists. Similarly, though f might not be continuous 
at a, we might have continuity from one side or the other. 


Definition 7.5.19 A function f is said to be left continuous at a provided 
lim f(x) = fi@ (7.28) 
x7 aT 
Similarly, f is said to be right continuous at a provided 


lim, fe) = fla) (7.29) 
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An immediate consequence of Exercise 7.3.2 is the following. 


Theorem 7.5.20 A function f is continuous at a if and only if it is both left and 
right continuous at a. 


In defining continuity on a set, Definition 7.5.17 stipulated that the set must 
be open. This allowed for a definition of continuity on (a, b), but not on [a, b]. 
One-sided continuity now allows us to define continuity on [a, b] in such a way 
that we do not concern ourselves with how f behaves or whether it even exists 
outside of [a, b]. 


Definition 7.5.21 A function f is said to be continuous on [a, b] provided the 
following hold: 

1. f is continuous on (a, b), 

2. f is right continuous at a, 


3. f is left continuous at b. 


Definition 7.5.21 can be naturally adapted to apply to the following example 
by omitting stipulation 3. 


Example 7.5.22 f(x) = ./x is continuous on [0, +00). From Exercise 7.5.18, 
f is continuous at all positive real numbers, and by Exercise 7.3.5, it is right 
continuous at 0. 


Implications of Continuity 


Continuity has many implications, and in this section we look at three of them. 
The second of these results is actually logically equivalent to continuity. 


The Intermediate Value Theorem 


The imagery that we can sketch a continuous function without picking up the 
pencil makes this first result seem plausible. It says that a continuous function 
cannot be negative at one point and positive somewhere else without crossing the 
x-axis somewhere between the two points. One case of this is stated in the next 
exercise. To prove it, imagine standing on the x-axis at a and looking up the x-axis. 
There is a natural subset of the real numbers that is both non-empty and bounded 
from above by b to which the LUB property can apply. 


EXERCISE 7.6.1 Suppose a < b, f(a) < 0 < f(b), and f is continuous on [a, b]. 
Then there exists c € (a, b) such that f(c) = 0.2 


2 LetA= {x € [a, b] : f(x) < 0}. What can you say about A? 
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If gis a continuous function such that g(a) > 0 > g(b) for some a < b, then we 
can apply Exercise 7.6.1 to —g to produce some c € (a, b) such that g(c) = 0. 

Exercise 7.6.1 provides most of the machinery needed to prove a theorem 
whose name you might remember from calculus, the Intermediate Value Theorem 
(IVT). We will supply the proof here to illustrate a convenience, where a specific 
result like Exercise 7.6.1 can generalize very easily. 


Theorem 7.6.2 (Intermediate Value Theorem). Suppose f is continuous on 
[a, b], and suppose f(a) < f(b). Let yo be any real number satisfying f(a) < yo < 
f(b). Then there exists c € (a, b) such that f(c) = yo. 


In the IVT, f does not necessarily change from negative to positive as x runs 
from a to b. However, if we create a new function g by dropping or raising f to 
make g(a) < 0 < g(b), then we can apply Exercise 7.6.1 to g and see what it says 
about f. 


Proof. Define g(x) = f(x) — yo. Since f and the constant function yo are both 
continuous on [a, b], Theorem 7.5.5 and our work with one-sided continuity imply 
that gis continuous on [a, b]. Furthermore, g(a) = f(a) — yo < Oand g(b) = f(b) — 
yo > 0. By Exercise 7.6.1, there exists c € (a, b) such that g(c) = 0. Thus f(c) = yo. 


EXERCISE 7.6.3. Prove the fixed point theorem: If f : [a, b] > [a, b] is a contin- 
uous function, then there exists c € [a, b] such that f(c) = c. 


EXERCISE 7.6.4 The fundamental theorem of algebra says every polynomial 
function whose degree is odd has a real root. That is, if P(x) = dopa t + 
d2nx2" +--+ + a,x +o, then there exists a real number c such that P(c) = 0. In 
this exercise you prove the fundamental theorem of algebra, first for the case 
a2n41 > 0 and then for the case a2,41 < 0. 


(a) Use your definitions from Exercise 7.4.9 to prove 


lim x=+0oo and = lim x=—oo (7.30) 
X—> +00 X—>—0OoO 


(b) With part (a) in hand, a result like Exercise 7.4.11 would be demonstrable 
for x + too by paralleling its proof. Assuming this result, show that P(x) as 
defined above and with a2,+41 > 0 satisfies 


lim P(x)=-—oo and = lim P(x)=+00 (7.31) 
X—>—0O X—> +00 


Consider the function g(x) = f(x) — x. If either g(a) or g(b) is zero, you’re done. Otherwise, apply 
the IVT. 
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(c) Use your result from part (b) to prove the fundamental theorem of algebra 
for the case azy,41 > 0. 


(d) Prove the fundamental theorem of algebra for the case a2, +41 < 0 by applying 
part (c) to — P(x). 


Theorem 7.1.7 and Corollary 7.1.9 demonstrated that strict monotonicity of 
a function on a set implies that it is one-to-one (hence invertible on the image 
of the set). The converse of Corollary 7.1.9 is clearly not true, as is illustrated by 
f(x) = 1/x on the nonzero real numbers, which is one-to-one but not monotone. 
However, if f is one-to-one and continuous on a set, then it must be strictly 
monotone. The IVT will come in handy in the proof of the next result. 


EXERCISE 7.6.5 If f is continuous and one-to-one on [a, b], then f is strictly 
monotone on [a, b.45 


The IVT can also help us prove that a continuous, invertible function has a 
continuous inverse. Specifically, if f is continuous and invertible on [a, b], then it 
is continuous and one-to-one. By Exercise 7.6.5, f is strictly monotone on [a, b] 
also. If f is strictly increasing, then f(a) < f(x) < f(b) for all x € (a, b). By the 
IVT, every y € [ f(a), f(b)] has a pre-image in [a, b], so that f([a, b]) = [ f(a), f(b)]. 
By similar reasoning, if f is strictly decreasing, f([a, b]) = [f(b), f(a]. In either 
case, f([a, b]) = [c, d] for some c < d. To show f~* is continuous on [c, d], you will 
have to verify all three criteria in Definition 7.5.21. 


EXERCISE 7.6.6 Suppose f is continuous and invertible on [a,b]. Then f~! : 
f(a, b]) = [a, b] is continuous on f([a, b)). 


Continuity and Open Sets 


The logical equivalence of the e-5 form of continuity and the sequential limit form 
conveyed by Theorem 7.5.10 are useful, not only because they give us freedom to 
exchange one property for another, but also because they suggest an alternative 
way to define continuity that might be preferred in the creation of some mathe- 
matical structures. Exercise 7.6.7 states another feature that is logically equivalent 
to continuity, this time in terms of the pre-images of open subsets of the codomain. 
In the theorems that follow, we are going to assume that the functions involved 
are defined on the entire set of real numbers. This will keep the proofs relatively 
simple and get the point across, though similar theorems can be addressed on 
restricted domains. 


See Exercise 1.2.18(i). 
Follow your nose. For f not monotone and f(a) < f(b), spend some time showing there exist cy < 
e2 <3 such that f(cy) < fle2) > f(e3) or ley) > lez) < fies). 
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When you prove Exercise 7.6.7, you can write a very elegant proof if you 
will use a slightly different form of the definition of continuity than that in 
expression (7.17). The following statements are all equivalent: 


In —a| <8 > |f) — f@| <« 
x € Ns(a) > fix) € Nel flay] 
FINs(a)] & Nel flay] (7.32) 


If you will use (7.32) as you apply continuity, you will find that Exercises 4.3.6, 
4.3.10, and 4.3.13 make for some pretty nifty manipulations of the sets involved in 
the proof. 


EXERCISE 7.6.7 A function f :R-— R is continuous if and only if the pre- 
image of every open set is open. 


With Exercise 7.6.7, the following should drop right into your lap. 


EXERCISE 7.6.8 A function f :R-— R is continuous if and only if the pre- 
image of every closed set is closed.® 


Let’s digress just a moment to comment on the significance of Exercise 7.6.7. 
First, the logical equivalence of the two statements in Exercise 7.6.7 suggests 
yet another place one might begin in defining continuity of a function. If one 
were to begin by defining a function to be continuous provided the pre-image of 
every open set is open, then our e-6 definition and the sequential limit property of 
Exercise 7.3.6 would become theorems in the analysis of real numbers. If it seems 
a little unnatural to begin with such an open set pre-image definition, consider the 
following, which is a glimpse into topology. 

At the beginning of Chapter 5, we said that a defining characteristic of analysis 
is that elements of a set have either a measure of size (norm) or of distance between 
them (metric). In the real numbers, the measure most commonly used is absolute 
value, so that |x| is the size of x and |a — D| is the distance between a and b. If aset 
is endowed with a metric, then a neighborhood of a is defined as all points within 
a certain distance of a. Then a definition of open set like Definition 5.3.2 becomes 
meaningful because openness is defined in terms of neighborhoods. 

In topology there is no such notion of size or distance either defined or assumed 
on the set. Instead, we begin with the idea of open set in what might seem a peculiar 
way. Given a set S, we declare some subsets to be open just because we say so. If 
you want to declare every subset of S to be open, fine. However, that might not 
prove to be especially interesting. To prevent your unbridled freedom to define 
open sets from degenerating into mathematically useless anarchy, your collection 
of open sets needs to have some of the same properties of open sets we derived 
as theorems for the real numbers. Specifically, if we lump all the subsets of S that 


® See Exercise 4.3.11(c). 
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we declare to be open into the family O, then in forming a topology we insist on 
the following. 


(T1) Both Z and S are sets in the family O. 


(T2) If F C O is a collection of open sets, then (),-¢ A is open. That is, O is 
closed under union. 


(T3) If {Ag : 1 <k <n} isa finite collection of open sets, then (;_, Ax is open. 
That is, O is closed under finite intersection. 


Closed sets are then defined to be those whose complement is open, and you are 
on your way to building a structure that will have some parallels to those we have 
studied for the real numbers. But the structure will be more abstract and austere 
because the definition of open set does not probe as deeply into some assumed 
structure of the set. 

Exercise 7.6.7 will also help you prove the following. 


EXERCISE 7.6.9 The continuous image of a compact set is compact. That is, if 
g p p 
f :R— Ris continuous on a compact set S, then f(S) is compact. 


In Exercise 5.4.7 , you showed that if L is the LUB of aset Sand L ¢ S,then L 
is a cluster point of S. Since a compact set is bounded, it has an LUB, and since it is 
also closed, it contains all its cluster points. Thus a compact set contains its LUB. 
Similarly, a compact set also contains its GLB. If f : R > R is continuous and S$ 
is compact, then f(S) is compact by Exercise 7.6.9. Consequently, f(S) contains 
its LUB and GLB. Write M = max[f(S)] and m = min[ f(S)]. Then there exist 
x1,x2 € S such that f(v1) = m and f(x2) = M. We have just proved the following 
theorem. 


Theorem 7.6.10 (Extreme Value Theorem (EVT)). If f : R > Ris continu- 
ous on a compact set S, then f attains a maximum and minimum value on S. 


EXERCISE 7.6.11 Give two examples to illustrate that compactness is necessary 
for the EVT to apply to a continuous function, one example where S$ is bounded 
but not closed, and one where S is closed but not bounded. 


Uniform Continuity 


In Section 5.2 we used a metaphor involving diseases, medication, and panaceas 
to distinguish between every question having an answer and the existence of a 
single answer that applies to every question. From Definition 7.5.17, continuity of 
a function f ona set A can be written logically as 


(Wa € A)(We > 0)(A5 > 0)(Wx € A)(\x —al <5 > |f() — f@| <2) (7.33) 


In general, the value of 6 will depend on both «€ and a. In this section, we want to 
define a form of continuity in which 6 can be found that depends only on ¢ and 
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works for alla € A. We will look at some examples to illustrate the point, and then 
we will study two important theorems. 


Definition and Examples 


Let’s take the definition of continuity in (7.33) and leapfrog the first component 
piece (Va € A) two jumps to the right. 


(We > 0)G6 > 0)(Wa € A)(Vx € A)(|x — al < 6 > | f(x) — f@| < © (7.34) 


Moving (Va € A) to the right of (Ve > 0) does not change anything logically 
from (7.33). However, moving (Va € A) to the right of @é > 0) makes a big dif- 
ference. If you were writing a proof of continuity from (7.33), you would begin by 
picking a € A ande > 0. Then, with both a and ¢€ in hand, you would go hunting for 
6 > 0 with the required properties. But with the phrase (Va € A) repositioned as 
in (7.34), things are different. If you were writing a proof of a theorem where (7.34) 
was involved, you would begin by picking only € > 0. Then, without knowing any 
particular value of a € A, you would have to find 6 > 0 from « alone, and this 6 
would have to serve for all a € A. So (7.34) suggests that the value of 5 can be 
found after having specified only ¢, and this 5-value will work for all a, x € A satis- 
fying |x — a| < 6. Because (7.33) specifies a first, we think of a as being the center 
of a 5-neighborhood and x as an arbitrary point in that neighborhood. There is 
no reason to call one point a and the other one x, as if one were fixed before the 
other. The point is that there exists 5 > 0 such that if any two points are within 6 
of each other, then their functional values are within € of each other. For clarity 
and convenience, we change the symbols slightly in the following definition. 


Definition 7.7.1 A function f : S > R is said to be uniformly continuous on 
A C Sprovided for alle > 0, there exists 6 > Osuch that, for allx, y € A, |x — y| <6 
implies | f(x) — f(y)| < €. Logically, we may write this as 


(We > 0)(45 > O)(Vx, y € A(x — yl < 8 > [f@) — FO) < 9 (7.35) 


To show a function f is uniformly continuous on A, we work backwards from 
| f(x) — f(y)| < ¢. Watch what happens in the next example. 


Example 7.7.2 Show that f(x) = x*/(x + 1) is uniformly continuous on [0, 00). 


Solution Before we write a demonstration, we need to do some scratchwork. If 
you play with the expression | f(x) — f(y)| < € and try to factor |x — y| out of it, 
you can arrive at the following. 


y? 


x+1 y+l 


xy+tx+y 
(«+)Dyv+1 


If) — fo) = (7.36) 
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Next, let’s split up the fraction in the right-hand side of Eq. (7.36) and apply 
the triangle inequality. Also, notice that x, y > 0 implies that 1/(x + 1),1/(@+ 1), 
x/(x +1), and y/(y + 1) are all less than 1. So we have 


xy+x+y 
(«+)Dvt+1)|7— 


xy | 
«+Do+) 


XxX 
: aeboTo|* ae 


S mallet! mo leen| 


| <3 
(yt DD} /@+D]— 


(7.37) 


Having arrived at | f(x) — f(y)| < 3|x — yl, we are ready to write a proof. 
Let « > 0 be given, and let 5 = €/3. Then for any x, y > O such that |x — y| < 6, 
we have 


2 2 


10) — F091 =| z 2 aE) 


(«+)yv+l 


= |x — yl 


xt1 yt+l 


< |x vi( 


+ 


(x +1) | | (y+ 1) | (7.38) 


a) 
(x + 1) 


+ 
eraulicean! ees 
<3|x-—y| <36=€ 
a 
The scratchwork of Example 7.7.2 suggests the following more general result. 


EXERCISE 7.7.3 If there exists m > 0 such that | f(x) — f(y)| < m|x — y| for all 
x,y € A, then f is uniformly continuous on A. 


If y 4 x, the hypothesis condition of Exercise 7.7.3 is equivalent to 


a fx) — 


(7.39) 


which means that f has a bound on the slopes of lines through any two points on 
its graph. Loosely speaking, if the steepness of f (as measured by slopes of secant 
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lines) is bounded, then f is uniformly continuous. The converse of Exercise 7.7.3 
is not true. Shortly, we will point out a function f that is uniformly continuous on 
a set A, but for which inequality (7.39) does not hold across A for any m > 0. 

Naturally, we want to include a demonstration that a continuous function need 
not be uniformly continuous on a set. 


EXERCISE 7.7.4 What does it mean for f not to be uniformly continuous on A? 
Example 7.7.5 Show that f(x) = 1/x is not uniformly continuous on (0, 1). 

Solution First, we do some scratchwork. The vertical asymptote at x = 0 will 
provide us with x and y values that are very close together but whose functional 


values can differ as much as a strategically chosen e-value. We have to play with 
the inequality 


1 |= 2 = 


ae 


(7.40) 


and find some ¢€ > 0 so that, regardless of 5 > 0, we can find two points x and y 
that are within 6 of each other and satisfy inequality (7.40). No obvious €-value 
jumps out at us, so let’s try e = 1 and see if we can proceed. 

Inequality (7.40) itself suggests a way to find x and y. Whatever we decide to 
let x be, we can let y = x + 6/2 so that |y — x| = 6/2 < 6. Then the trick is to let x 
be sufficiently close to zero so that |x| |y| is small enough to make inequality (7.40) 
true. Furthermore, since smaller values of 5 represent our primary obstacle, we may 
assume 6 is smaller than any convenient positive number, if such an assumption 
appears helpful. If we let x = 6/2 and y = 6, inequality (7.40) falls right into place, 
as long as 6 < 1. Here is our demonstration. 

Let « = 1, and pick any 5 > 0. We may assume that 6 < 1. Let x = 6/2 and 
y = 6. Then |x — y| < 6, and 


= 8/2 1 
Lae are >1l=e (7.41) 


fe) Ola = eps 


Thus f is not uniformly continuous on (0,1). 


In Example 7.7.5, the fact that (0,1) is not closed allows for 1/x to have a 
vertical asymptote at one end point. Even though 1/x is continuous through- 
out (0,1), the asymptote is where we look to disprove uniform continuity. In 
the next exercise, you will need to look among sufficiently large numbers where 
you will find x and y within 6 of each other whose functional values differ by at 
least e. 


EXERCISE 7.7.6 Show that f(x) = x’ is not uniformly continuous on [0, +00). 
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7.7.2 Uniform Continuity and Compact Sets 


Perhaps the most beloved theorem dealing with uniform continuity is the 
following. 


Theorem 7.7.7 If f : S— Risacontinuous function and S is compact, then f 
is uniformly continuous on S. 


There is something about the fact that every open cover of S is reducible 
to a finite subcover that allows us to liberate the value of 6 from any spe- 
cific points in the domain and determine it from € alone. We will supply the 
proof here because it requires a few sneaky shrinkings of neighborhoods. If 
you want to try to prove it on your own, here is a thumbnail sketch of how to 
proceed. 

As usual, we pick e > 0. Then since f is continuous at every point in S, we can 
cover S with a slew of neighborhoods, one centered at each a € S, whose radius is 
half the 5-value that guarantees f[N3(a)] © Ne2[ f(a]. Note that each value of 6 
will depend on a. Since every a is in its own 6-neighborhood, the set of all these 
neighborhoods covers S. Compactness of S then allows us to reduce this cover to a 
finite subcover. These finitely many neighborhoods supply us with a single 5-value: 
the minimum of the finitely many 5/2-values from the subcover. We can then show 
that if |x — y| < 6, then | f(x) — f(y)| < €. See if you can fill in the details. If not, 
here is the whole proof. 


Proof. Lete > 0be given. Since f iscontinuous at every a € S, then for any partic- 
ular a, there exists 5(a) (the notation illustrating that 6 is a function of a), such that 
|x — a| < 6(a) implies | f(x) — f(a)| < €/2. Cover S with the set C = {Nsa/2(@) : 
a € S}. Since every a is in its own neighborhood of radius 5(a)/2, C does in fact 
cover S. Since S is compact, C has a finite subcover Cy = {N5a,)/2(ax) : 1 < k <n}. 
Let 5 = min{é(az)/2: 1 < k < n},andpick x, y € Ssuchthat|x — y| < 6. Nowsince 
C; covers S, there exists some k such that x € N5(a,)/2. Furthermore, y € N5(a,) 
because 


B(ax) _ (ax), Sax) _ 


aa 5 = 84x) (7.42) 


ly — ag] S ly — x] + |x — | < 6+ 


Thus 


€ 


56 (7.43) 


Lf) — FO)! < Lf) — flaw)| + | flak) — fO| < , a 


so that f is uniformly continuous on S. 
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In Exercise 7.5.18 you showed that f(x) = ./x is continuous on (0, +00). Since 
lim,_,9+ /x = 0 = V0, fiscontinuous on [0, 1], and hence is uniformly continuous 
there. However, the graph of f gets very steep as x > 0°, so that (7.39) is not 
satisfied. 


EXERCISE 7.7.8 Show that inequality (7.39) is not satisfied by f(x) = /x on 
[0, 1] for any m > 0. 


This page intentionally left blank 


8.1 


8.1.1 


Groups 


Inits simplest terms, algebra is the study of sets on which binary operations provide 
the defining internal structure for the set. For example, we may construct a set 
and define a form of addition or multiplication, then look at the structure of the 
set and the relationships between its elements that result from these operations. 

Of particular interest in algebra is the study of mappings between sets S; and Sz 
where the structure of the binary operation on S; is preserved among the images 
of the elements in $7. More concretely, if S$; has binary operation *, if Sp has binary 
operation -, andif f is a function from S) to S2, we address the question of whether 
flax b) = f(a): f(b) for all a, b € S,. Whether f is one-to-one or onto also leads 
to some interesting results. 

In this chapter, we begin our study of some of the most basic concepts of 
algebraic structures by starting with groups. Some of the theorems are actually 
restatements of results we have already seen for the real numbers. Now, however, 
the context is broader, more abstract, so the air might seem a little thinner at 
first. Instead of proving algebraic theorems for which we have the real numbers 
specifically in mind, we prove theorems based on assumptions that certainly apply 
to real numbers, but of which the real numbers are only a specific example. 


Introduction to Groups 
Basic Characteristics of Algebraic Structures 


An algebraic structure begins with a non-empty set and builds its internal structure 
in several stages. First there must be some notion of equality either defined or 
understood on the set, which naturally must be an equivalence relation. One of 
the assumptions we stated in Chapter 0 was that equality in the real numbers 
satisfies properties E1—-E3, though we did not really probe into what is behind this 
assumed equality. Our work in Section 3.8 was a great example of how equality can 
be created in a set. Assuming that integer equality is an equivalence relation, we 
built the rationals using the integers as building blocks, where we defined rational 
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equality in terms of integer equality. Then we showed that this definition is an 
equivalence relation. 

Once the set is constructed and equality is defined, we can define one or more 
binary operations. By definition, a binary operation is well defined and closed. 
Let S be a set we have constructed, let = represent equality on S, and let x be a 
binary operation defined on S. To say that the binary operation is well defined is 
to say that for all a,b,c,d € S, ifa=b andc=d, thenaxc=bxd. Note how 
this requirement appeals to equality on S. From this point forward, we will almost 
always use = to represent equality as it is defined on S. 


Example 8.1.1 By assumption, addition and multiplication are well defined and 
closed on the real numbers, so that both are binary operations. Also by assumption, 
the positive real numbers are closed under addition and multiplication. 


Example 8.1.2 By the remarks after Definition 3.8.6, addition is closed on 
the rationals. By Exercise 3.8.7, addition is well defined. Similarly, rational 
multiplication is closed, and by Exercise 3.8.10 it is well defined. 


EXERCISE 8.1.3 Determine with explanation whether each of the following 
defines a binary operation on the given set. 


(a) Addition on the negative real numbers. 

(b) Multiplication on the negative real numbers. 

(c) Addition on the irrational numbers. 

(d) Multiplication on the irrational numbers. 

(e) For a non-empty set U, the operation of union on the power set of U. 

(f) For anon-empty set U, the operation of intersection on the power set of U. 


(g) Greatest common divisor on the positive integers. 


EXERCISE 8.1.4 Fora given non-empty set A, let S be the set of all bijections 
from A to itself. Is composition a binary operation on S? Explain. 


By our work in Section 3.11, a binary operation on S can be defined as a function 
f :S x S — S. The formal notation that would be used to represent addition as 
such a function would be +: S x S > S, writing +(a, b) to mean a+ b. The fact 
that + as a function has property F1 is precisely what it means for the binary 
operation of addition to be closed. That + has property F2 means addition is well 
defined as an operation. Thus the fact that+ : § x S > Sisa function is equivalent 
to addition being a binary operation. 

All the binary operations we will address in this chapter will have the asso- 
clative property. We might have to verify associativity, however, if the context we 
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are working in is new and we have created a binary operation from scratch. Some 
binary operations are not associative, but they are indeed rare. 


EXERCISE 8.1.5 Give an example of a binary operation that is not associative.! 


Many, but not all, of the binary operations we will consider will be commutative. 
Some very interesting results of algebra derive from binary operations that are not 
commutative. Be careful in your work. Unless commutativity is explicitly given 
or proved, you might be tempted to reverse the order of elements without any 
permission to do so. 


EXERCISE 8.1.6 Show that the binary operation in Exercise 8.1.4 is not 
commutative. 


The existence of an identity element for a binary operation is rather context 
specific. In the next definition we insist that an identity element must commute 
with every element of the set, regardless of whether the binary operation is 
commutative. 


Definition 8.1.7 Suppose S is endowed with binary operation «. Then e € S is 
called an identity for the operation * ifa*e=exa=aforallae S. 


If there is an identity element for a given binary operation, then it might be 
that some or all elements have an inverse. 


Definition 8.1.8 Suppose S has binary operation «, for which there is an iden- 


tity element e. If for a given a € S there exists b € S such thataxb=bxa=e, 


then we say that b is an inverse of a, and we write it as a~!. 


For convenience we sometimes list the features of an algebraic structure as 
an ordered list. For example, if S is a set with binary operation x, identity e, and 
with the feature that every a € S has an inverse a~!, then we might write such a 
structure as (S, +, e,~!). 

Of the basic algebraic properties we assumed on the real numbers (A2-A14), 
the only one we have not mentioned yet is the distributive property (A14). Ifa 
set is endowed with two binary operations, it might be that they are linked in their 
behavior by the distributive property. We will address algebraic structures with 
two binary operations in Chapter 9. 

If S is a small finite set, it might be convenient to describe the binary operation 
in a Cayley table. Since a binary operation might not be commutative, it is impor- 
tant to read a « b from a Cayley table by going down the left column to find a and 
across the top to find b. 


1 See Section 4.8. 
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Example 8.1.9 Consider S = {0,1,2,3,4,5} and let © be described as in 
Table 8.1. 


(8.1) 


ARWNF OD 
ARWNFR CIO 
CONRWNHR 
POUR WNID 
NE OWA Vw 
wWBNRFOWN AA 
BRWNFPOUWN 


That © is well defined is immediate, for there is a unique value in each position 
in the table. Closure of @ is also obvious, since every entry in the table is an 
element of S. Hi 


EXERCISE 8.1.10 Is @ is commutative? How can you tell? Is there an identity 
element? Does every element have an inverse? 


One way to verify associativity of 6 would be to do all possible calculations 
of the form (a @ b) @c anda @ (b @ c) to check if they are equal. In Section 8.3, 
we will construct this algebraic structure formally and prove associativity as a 
theorem. 


Example 8.1.11 Let S be the set in Example 8.1.9 and define the operation ® 
as in Table 8.2. 


@|0 12 3 4 5 
0f0 00 0 0 0 
1/0 12 3 4 5 
2/0 2402 4 (8.2) 
2/0 2.0 3 6 3 
4/0 42 0 4 2 
00. 5 Ao SO 
P| 


EXERCISE 8.1.12 Is @ commutative? Is there an identity element? Does every 
element have an inverse? 


Groups Defined 


We have names to refer to algebraic structures with certain sets of features. Here 
is our first such name. 


N 
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Definition 8.1.13 Suppose G is a set with associative binary operation +, 
identity element e, and with the property that every g € G has an inverse g~! 
under «. Then the algebraic structure (G, x, e,~! ) is called a group. If * isa commu- 
tative binary operation, then G is called an abelian group (after the mathematician 
Neils Henrik Abel (1802-1829)). If G is finite, the cardinality of G is called the 
order of the group (usually denoted o(G) instead of |G|). 


According to Definition 8.1.13, a group has the following defining features. 


(G1) The operation « is well defined. 
(G2) The operation «x is closed. 

(G3) The operation x is associative. 
(G4) There is an identity element e. 


(G5) Every element of G has an inverse under x. 


Example 8.1.14 The real numbers with binary operation addition, identity 
zero, and additive inverses form an abelian group: (R, +, 0, —). For G1 is prop- 
erty A2, G2 is A3, G3 is A4, G4 is A6, and G5 is A7. That the group is abelian is 
property A5. Also, the integers and the rational numbers are abelian groups 
under addition. I 


EXERCISE 8.1.15 Determine whether each of the following is a group. If not, 
which of the properties G1-GS fails to hold? 


(a) The nonnegative real numbers with addition. 

(b) The integers with multiplication. 

(c) The nonzero real numbers with multiplication. 

(d) For a non-empty set U, the power set of U with binary operation union. 

(e) Foranon-empty set U, the power set of U with binary operation intersection. 


(f) For anon-empty set U, the power set of U with binary operation symmetric 
difference.” 


Example 8.1.16 The algebraic structure (S, @, 0, —) from Example 8.1.9 is an 
abelian group. However, (S,@,1,~!) from Example 8.1.11 is not a group, for 
some elements do not have inverses. li 


Perhaps you have finally earned the right to say, “Proof by picture” to prove associativity. Ask your 
instructor. 
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Example 8.1.17 Here is another example of a finite abelian group. In 
Exercise 2.2.5, you observed that x* = —1 has no real solution x. Nothing pre- 
vents us from creating a symbol, say i, declaring i* = —1, and then noting that i is 
not a real number. Now consider the set S = {+1, +i} with binary operation x 
defined according to Table 8.3. Then (S, x, 1,~!) is an abelian group. 


x] 1 -1 i -i 
1) 1-1 Gi =i 


—1}-1 1 -i i (8.3) 

i i -i -l 1. 

-—i | -i i 1 -1 
i 
Example 8.1.18 Similar to Example 8.1.17, define x on Q = {+1, +i, +j, +k} 
according to Table 8.4. The gist of this algebraic structure can be understood 
by noticing that i? = j? =k? =—1, but i, j, and k do not commute with each 


other. If you think of the letters i, j, and k being written on the face of a clock 
at 12, 4, and 8 o’clock, respectively, then multiplication of two different elements 
in the clockwise direction produces the third. Multiplication of two elements in 
the counterclockwise direction produces the negative of the third. For example, 
j xk =iandi x k = —j. These three square roots of —1 are called quaternions, 
and they motivate a group with eight elements. 


x 1 -1 i i J o-i k —k 
1 1 -1 i -i J o-i k —k 
—1|-1 1 -i i -j j w-k k 
i i -i —-1 1 k -k -j J 
-—i| -i i 1 -1l -k J o-J 

8.4 

J] Jj -j —-k k -1 1 i i ey) 
-j | -J J k —-k 1 -1 -i i 
k k —-k j -j -i i -l 1 
—k | -k k -j J i -i 1 -1 

a 


EXERCISE 8.1.19 On the set S = {0, 1, 2, 3, 4, 5}, let o be defined by Table 8.5. 


(a) Explain how you know that o is a binary operation. 
(b) Is the operation commutative? 
(c) What is the identity element? 


(d) Determine the inverse of each element. 
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(8.5) 


AP WNrR OO 
AR WNrR O]O 
WNaROrR] FR 
PwWwhRONN YN 
NRF ON A W] DK 
OMNFRPW AA 
RORPWN UN 


To verify that the binary operation in Exercise 8.1.19 is associative would be 
a formidable task if all you have is the Cayley table. Suffice it to say that o is 
associative, so that S is a non-abelian group. This group is actually motivated by a 
group whose elements are functions. You will see this group in Section 8.4. 

If someone gives you a set with a binary operation and asks you to show that 
it is a group, you must verify properties G1-G5. To give you an idea of how that 
might look, we will now walk through most of the details of a specific example. 

Define the complex numbers by 


C={a+bi:a,be R} (8.6) 


At this point, i is a mere symbol with no meaning. We should think of C merely as 
a set of ordered real number pairs, the first of which stands alone and the second 
of which is tagged with an adjacent i. Furthermore, the + in a + bi is not meant to 
denote real addition, as if the expression a + bi could be simplified. To avoid this 
possible confusion, some authors define elements of C as real number ordered 
pairs (a, b). 

Next we define equality in C, which for clarity we temporarily denote =. We 
define a1 + bi = az + bai, provided the real number equations a1 = az and bi = 
b2 are satisfied. To show that = is an equivalence relation is pretty trivial. For 
example, to show = has property E2, suppose a1 + bi = a2 + boi. Then aj = a2 
and b; = bo. Since real number equality has property E2, we have aj = a; and 
bz = by. Thus a2 + boi = ay, + bi. 

Define the binary operation © in the following way. 


(a+ bi) ® (c+ di) =(at+c)+(b+d)i (8.7) 


We claim that C is an abelian group under @. Here are the steps of the proof in 
meticulous detail. Working through them will give you a good sense of direction in 
the exercise that follows, where you will show that the nonzero complex numbers 
C* with a form of multiplication is a group. 


(G1) Suppose a, + bji=a.+boi and cy +dyi=co+ di. Then a, =a), 
by = bo, cj = Co, and d, = dy. Since real number addition is well defined, 
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aj +c, =a2+c2 and bj + dj = bp + dp. Thus 


(aq, + byt) B (ey + hi) = (4 +1) + (1 +a )i 
= (az + €2) + (b2 + d)i (8.8) 
= (a2 + bai) B (c2 + dai) 


(G2) Pick a+ bi,c + di €C. Since the real numbers are closed under addition, 
a+c,b+deéR,so that (a+ bi) 6 (c+ di) = (a+o+(b4+d@i eC. 


(G3) 


[(a+ bi) ® (c+ di)] @ (e+ fi) =[(a+c) + (b+ di] @ (e+ fi) 
=[(a+c)+e]+[(b+d) + fli 
=[a+(c+e]+lb+d+fli (89) 
=(at+bi) l(c+e)+ d+ fil 
= (a+ bi) @ [(c +di) 6 (e+ fi] 


(G4) Pick a+ bie C. Then (a+ di) @ (04+ 01) = (44+ 0) + (64+ 0)i = a+ bi. 
Similarly, (0 + 01) @ (a+ bi) = (0+ a) + (0+ b)i = a+ bi. Thus 0 + 0: fun- 
ctions as an additive identity. 


(G5) Picka+ bi. Thena, b € R,so that —a, —b € Ralso. Thus (—a) + (—b)i € C. 
Furthermore, (a+ bi) @ [(—a) + (—b)i] = (a—a) + (b— b)i = 04 03, and 
[(—a) + (—b)i] ® (a+ bi) = (-a+a)+(—b+b)i=0+0i, so that C is 
closed under inverses. 


Finally, (C, 6, 0 + 0i, —) is abelian because 


(a+ bi) ®@ (c+ di) = (at+c)+ (b+ d)i 
=(ct+ta)+(d+Db)i (8.10) 
= (c+ di) (a+ bi) 
If we had shown that © is commutative before we showed that properties 


G1-G5 hold, then the proofs of properties G4 and G5 would not have required 
the two-sided demonstrations we provided. 


EXERCISE 8.1.20 Define a form of multiplication ® on C* by 


(a + bi) ® (c + di) = (ac — bd) + (ad + be)i (8.11) 


8.1 Introduction to Groups 251 
Show that C% is an abelian group under @.>-4 


EXERCISE 8.1.21 Prove that the integers with binary operation a*xb= 
a+b-+1 form an abelian group. 


EXERCISE 8.1.22 If we define a binary operation by ax b= a+ b-—ab, then 
there exists a real number xo such that R — {xo} is a group under x. Find x9 and 
show that R — {xo} with « is an abelian group.> 


EXERCISE 8.1.23 Suppose (G, *, e,~!) is a group. 
(a) Show that the identity element is unique. 


(b) Prove the left and right cancellation laws: 

(i) Ifexa=cxb,thena=b. 

(ii) Ifaxc=bxc,thena=b. 
(c) Show that the inverse of a group element is unique. 
(d) Show that (a~!)~! =a for allae G. 


(e) Show that (a*b)~! =b-! xa7t. 


-1 -1 


(f) Show inductively that (a; * az *---*a,)~' =a7! *a peo Ra. 


n n—- 

In Exercises 3.5.4 and 3.5.5, we defined a” for nonzero real numbers a and all 

integers n, then derived rules for exponents. In the context of an arbitrary group G 

with binary operation +, we can now fix any a € G and make the same definitions 
for a", where we write 


a =e (8.12) 
a+! =a" xa forn>0 (8.13) 
ao Slay" (8.14) 


Furthermore, by mimicking exactly the proofs from these exercises done in the 
context of the nonzero real numbers, we arrive at similar exponent rules for 
* on G, except that one rule depends on G being abelian. 


Showing closure of ® involves verifying that the product is never 0 + 0i. Prove contrapositively by 
showing that if (a + bi) ® (c+ di) = 0 + 0i, then either a = b = 0 orc = d =0,s0 that either a + bi or 
c+di is not an element of C*. 

4 if (a+ bi) ® (c+ di) = 0+ 0i, then ac — bd = 0 and ad + bc = 0. Square these and add. 

5 You can’t find xp until you have found the identity. 


252 


8.2 


8.2.1 


6 


Chapter 8 Groups 


Theorem 8.1.24 Let G be a group, a, b € G, and let m and n be integers. Then 


a” xa" =a" (8.15) 
any = qi (8.16) 

Furthermore, if G is abelian, 
(ax b)" =a" xb" (8.17) 


EXERCISE 8.1.25 Suppose (G, *, e,~!) is a group such that a*a=e for all 
a € G. Then G is abelian.® 


EXERCISE 8.1.26 Let (G, *x,e,-!) bea group, and fix some g € G. Define f : 
G => Gby f(x) = g «x for all x € G. Then f is a bijection from G to itself. 


Convenience is desirable if it costs nothing in clarity. For this reason, we often 
refer toa group (G, x, e, |) simply as group G, and we sometimes use juxtaposition 
of terms ab to indicate the binary operation a « b, just as we do with multiplication 
in the real numbers. If we are working with two groups G; and G2, we should 
probably be more careful at first to distinguish between the symbols for the binary 
operations, and we might denote the identity elements as e; and e2, respectively, 
just to be clear. 


Subgroups 


You might have noticed that S from Example 8.1.17 is a subset of Q from 
Example 8.1.18. More than that, the binary operation on S is the same as that 
on Q, in the sense that Table 8.3 is a subtable of Table 8.4. Thus S is a subset of 
Q that is closed under the binary operation, contains the identity, and is closed 
under inverses. We give such a subset a name. 


Subgroups Defined 


Definition 8.2.1 Suppose (G, *, e,~!) is a group, H is a subset of G, and H is 
a group under the same operation. Then (H, e, *,~!) is called a subgroup of G, 
and we write H < G. If H is a proper subset of G, then A is called a proper 
subgroup of G. 


If we are given a subset H of a group (G, *, e, |) and we are asked to show 
that H is a subgroup of G, we must show that A is itself a group under «. Properties 


a* a = ¢ is equivalent to a = a~!. Use Exercise 8.1.23(e). 
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G1 and G3 are automatically satisfied on H, for if « is well defined and associative 
on all of G, then certainly it is well defined and associative when restricted to H. 
We say that H inherits these properties from G. We must therefore demonstrate 
only the remaining properties. 


(H1) The operation is closed on H (G2). 
(H2) The identity element is in H (G4). 
(H3) The inverse of every element of H is also in H; that is, H is closed under 


inverses (GS). 


Example 8.2.2 For any group G, {e} and G itself satisfy properties H1—H3 and 
are therefore subgroups of G. We call {e} the trivial subgroup. 1 


Example 8.2.3 The set of even integers is a subgroup of (Z, +, 0, —) since it is 
closed under addition, contains zero, and is closed under negation. 1 


Example 8.2.3 is a special case of the next result. 


EXERCISE 8.2.4 Letn bean integer and consider S = {kn : k € Z}, the set of all 
integer multiples of n. Show that S is a subgroup of the integers under addition. 
EXERCISE 8.2.5 Find all subgroups of each of the following groups. 

(a) The group in Example 8.1.9 


(b) The quaternion group in Example 8.1.18 
(c) The group with Cayley Table 8.18 


@®|0 1 2 3 4 
0/0 1 2 3 4 
1/1 2 3 4 0 (8.18) 
2/2 3 4 0 1 
3/3 4 0 1 2 
41/4 0 1 2 3 
EXERCISE 8.2.6 From the multiplicative group in Exercise 8.1.20, let 
H={at+bieC:¢a+bh= 1} (8.19) 


Show that H is a subgroup of C*. 


EXERCISE 8.2.7. LetG= {a+ bV2:a,b € Q, aand b not both zero}. Clearly, 
G is a subset of the real numbers. Also, by Exercise 3.10.6, a+ b/2 =c+dV2 
if and only if a=c and b = d. We want to show that G is a subgroup of R*, the 
multiplicative group of nonzero real numbers. 
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(a) Explain why zero is not in G. 


(b) Show that G is a subgroup of (R*, x, 1,~!) by showing it has properties 
H1-H3. 


EXERCISE 8.2.8 Define the center of a group G to be the set of all elements 
that commute with all elements of G. That is, the center is 


{ae G:axx=xx«a forall x € G} (8.20) 


Show that the center of G is a subgroup of G. 


EXERCISE 8.2.9 Suppose {Hy}ve4 is a family of subgroups of a group G. 
Determine with proof whether each of the following is a subgroup of G. 


(a) Maca Ha 
(6) Uses Ha 


Suppose H is a subgroup of G and H C H; CG C Gy. We can make the 
following observations about relationships between these sets. First, if Hj is a 
group under the same binary operation, then the fact that H is a subgroup of G 
implies H is a subgroup of H, also. For if H exhibits properties H1—-H3 as a subset 
of G, it also does so as a subset of Hj. Similarly if G; is a group, then H is a 
subgroup of G;. The most efficient way to describe this latter relationship is to say 
that < is transitive: If H < G and G < Gj, then H < G. 


Generated Subgroups 


From the quaternion group Q in Example 8.1.18, let’s take some subset of Q, 
say A = {i, j}. Although A is a subset of Q, A is clearly not a subgroup of Q. We 
can, however, determine the smallest subgroup of Q that contains all elements 
of A. We now address the existence and possible uniqueness of what we call the 
subgroup generated by A. Let’s begin by defining the term that conceptually means 
the smallest subgroup containing all elements of a given subset of the group. 


Definition 8.2.10 Suppose G is a group and A is a non-empty subset of G. 
Suppose also that H is a subset of G with the following properties. 


(U1) ACH. 
(U2) H is asubgroup of G. 
(U3) If Bis a subgroup of G,and A C B,then A C B. 


Then AH is called a subgroup generated by A, and is denoted (A). 
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Notice how properties U1-U3 lay out the appropriate criteria by which a set 
qualifies as a smallest subgroup that contains all elements of A. Property U1 
guarantees that any set that we would call (A) does in fact contain all elements 
of A, and property U2 guarantees that it is indeed a subgroup of G. Property U3 
guarantees that no other subgroup of G that contains all elements of A will be any 
smaller. The fact that (A) exists uniquely is guaranteed by the next exercise, which 
gives us one way to visualize its construction. 

Let A be a subset of a group G, and consider the family of subgroups of G, 
where each set in the family is a superset of A. That this family is non-empty is 
immediate, for G itself is a subgroup of G and is a superset of A. In the next 
exercise, you will show that the subgroup generated by A exists uniquely. What 
you will show is that the intersection of all subgroups of G that are supersets of 
A has properties U1—-U3, and that any two subsets of G that both have properties 
U1-U3 are actually the same. 


EXERCISE 8.2.11 LetGbea group, and suppose A is a non-empty subset of G. 
Then (A) exists uniquely. In fact, 


(A=) J (8.21) 


J<G 
JDA 


Equation (8.21) might not be the best way to construct (A). It might be easier 
to begin with the elements of A and build up (A) by tossing in only the necessary 
elements of G until you’re sure you're finished. 


Example 8.2.12 From Example 8.1.18, and A = {i, j}, determine (A). 


Solution Clearly, (A) must contain 1, and by property H1, it must contain 
i? =—1 and ix j=k. Appealing to H3, we must have —i, —j, —k € (A) also. 
Thus (A)=Q. 


EXERCISE 8.2.13 For the group from Example 8.1.9, determine the subgroup 
generated by the following sets. 


(a) {2,4} 
(b) {2,3} 
(c) {5} 


Cyclic Subgroups 


The subgroup generated by a singleton set {a} is usually denoted (a) instead of 
({a}). In this case, we call (a) the subgroup of G generated by the element a, anda 
is called a generator of this subgroup. Any subgroup that has a generator is called 
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a cyclic subgroup. The easiest way to see what (a) looks like is to build it from the 
bottom up, so to speak, and then demonstrate that what you have created satisfies 
U1-U3. Using the definition of a” in Eqs. 8.12-8.14, consider the set 


S={a":neZ} (8.22) 


The claim is that S = (a), which we verify here by showing that S has proper- 
ties U1-U3, thereby earning the right to be called (a). The details will help you in 
Exercise 8.2.24. 

Let n =1 to see that ae S, so that S has property Ul. To show S has 
property U2, we must show it has properties H1—H3. 


H1) Pick x, ye S. Then there exist integers m and n such that x =a” and 
y g 
y =a". Now m+n is also an integer, so a” xa" =a"*" € S. Thus S is 
closed under ». 


(H2) Letting n = 0, we have thate =a € S. 


(H3) Pick a” € S. Since Theorem 8.1.24 applies for all integers n, we have that 
(a")-! =a € S. Thus S is closed under inverses. 


To show that S has property U3, suppose B is a subgroup and a € B. We show 
S C B by showing a” ¢€ B for all n. Since a € S and B is closed under *, it must be 
that a” € B for all positive integers n. Certainly, a° = e € B, and since B is closed 
under inverses, a” € B for all positive integers n. Thus S C B, and we have 
finished the proof that (a) = {a" :n € Z}. 

Example 8.2.14 In the multiplicative group of nonzero real numbers, 

(3) = {(3" :n € Z} = {...,1/27, 1/9, 1/3, 1,3,9,1/27,...} i 
Example 8.2.15 We construct (3) in the additive group (Z, +,0, —). Note that 


zero is the identity in this context, so that 3°? = 0. Constructing 3” for n > 2 is 
repeated addition, not repeated multiplication. Thus 


31=3, 37=343=6, 3=643=9, 34=94+3=12, etc. (8.23) 
Constructing 3~” involves negation, not reciprocation, so that 
3-1 =-3 
3-7 = 3-1) = [(-3) + (-3)] = -6 (8.24) 
3° = 3) =[(-6) + (-3)] = —9, ete. 


Thus (3) = {(3k:keZ}. 
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Example 8.2.15 suggests it might be worthwhile to rewrite the exponent 
definitions in Eqs. (8.12)-(8.14) and the exponent rules in Eqs. (8.15)-(8.17) in 
what we call their additive form. In particular, if (G, +, 0, —) isa group anda € G, 
the fact that the binary operation is a form of addition suggests that, instead of 
writing a? = e, which seems to connote multiplication, we instead write 


0a =0 (8.25) 


We must be careful, though, for the zero on the left-hand side of Eq. (8.25) is an 
integer, whereas the zero on the right is the identity element of the group. Saying 
Oa = 0 means that the group element ais, in a sense, added to itself zero times, not 
multiplied by itself, and this is to be defined as the zero (identity) element of the 
group. Similarly, the additive form of Eq. (8.15) would be 


ma+na=(m-+n)a (8.26) 


EXERCISE 8.2.16 Let (G,+,0,—) be an additive group. Write Eqs. (8.12)- 
(8.17) in their additive form. 


EXERCISE 8.2.17 Determine the following cyclic subgroups. 
(a) (5) in (R*, x,1,~1) 

(b) (5) in (Z, +, 0, —) 

(c) (5) in the group from Example 8.1.9 

(d) (2) in the group from Example 8.1.9 

(e) (3) in the group from Example 8.1.9 

(f) (0) in the group from Example 8.1.9 


(g) (a), where a is any element other than the identity in the group with Cayley 
Table 8.18 


(h) (j) in the quaternion group from Example 8.1.18 
(i) (2i) in (C*, x, 1,7!) 

: 1 La x -1 

(j) (At wD in (C calls ) 

As Exercise 8.2.17 illustrates, even though a group is infinite, the subgroup 
generated by an element might or might not be infinite. In Example 8.2.14, 
the subgroup generated by 3 is an infinite cyclic subgroup. But let’s consider 
the subgroup generated by i in (C*, {0+ Oi}, x, 1+ 0i,~!). First, i = 1, i! =i, 
i? = —landi? = —i. But i+ = 1, and 4 is the smallest positive power of i for which 
this is true. Furthermore, for any integer n, the division algorithm allows us to write 
n = 4k-+r where 0 <r <3, so that i” = i***+” = (i4)k.i” = i”. Thus every power 
of i can be simplified to an element of { i? i!, i2, 7}, and we have shown that 


@={(":neZ}= (i, i, 7,3} = (1,7, -1,-3 (8.27) 


so that (i) is a finite cyclic subgroup in an infinite group. 
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This suggests a general result for some cyclic subgroups. Suppose G is a group 
and a € G. Suppose there exists some integer k such that a = e. Since a~* would 
also be the identity, we may assume k > 1. Of all positive values of & for which 


a‘ = e, let n be the smallest and consider the set 


T ={a’,a',a*,...,a°}} = {a :0<k<n-V (8.28) 


Since n is the smallest positive integer for which a” = e, no a‘ is the identity for any 
1 <k <n-—1. Thus e appears exactly one time in Eq. (8.28) as a’. Furthermore, 
all elements of T are distinct. 


EXERCISE 8.2.18 Show that the elements of T in Eq. (8.28) are distinct by 
showing that if 0 < k <1 <n—1,then a‘ ¥ d’ (or the contrapositive). 


Clearly, {a« :0<k <n—1} ¢ {a* :k € Z},s0 that T C (a). 
EXERCISE 8.2.19 Show 7 2 (a) to have T = (a). 
Thus we have shown the following. 


Theorem 8.2.20 Let G be a group and let a € G. Suppose a” = e for some 
positive integer n, and suppose n is the smallest such positive integer. Then 


GQ Sleaie, aaa) (8.29) 


If n is the smallest positive integer for which a” = e, we say that the element a 
has order n, and we denote the order of a by o(a). Ifa" # e for all positive integers 
n, we say a has infinite order. In Definition 8.1.13, we defined the order of a group 
as its cardinality. Here we are defining the order of an element of a group in terms 
of its powers. By constructing (a) as we have done here, we have demonstrated 
that these two uses of the term order are tied together. 


Theorem 8.2.21 If G is a group and ae G has order n, then the subgroup 
generated by a has order n. Conversely, if (a) has n elements, then a has order n. 


With Theorem 8.2.21, we can use o(a) to mean either the order of the element a 
or the order of the subgroup generated by a. Naturally, if G is finite and a € G, the 
subgroup generated by a will be finite. 


Example 8.2.22 In the quaternion group from Example 8.1.18, 
(k) = {k" :0 <n < 3} = {1,k, -1, —k} 


so that o(k) = 4. I 


7 Pick a € (a) and apply the division algorithm to k and n. 
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EXERCISE 8.2.23 IfGisa group and a € G, then (a) is abelian. 


EXERCISE 8.2.24 Suppose G is an abelian group, and a, b € G. Then 
({a, b}) = {a b” : m,n € Z} (8.30) 


EXERCISE 8.2.25 Write Eq. (8.30) in its additive form. 


EXERCISE 8.2.26 Consider the group (Z, +, 0, —), and let a and b be nonzero 
integers and g = gcd(a, b). Then ({a, b}) = (g). 


EXERCISE 8.2.27 Suppose G is a group and ae G. Let m and n be positive 
integers, and suppose gcd(m,n) = g. Then ({a”, a"}) = (a8). 


EXERCISE 8.2.28 For the additive group of integers, determine the following 
without proof. 


(a) (4) 6) 
(b) (10) (3) 
(c) (8)N (16) 
(d) (12) 1 (16) N (28) 
(e) ({4, 6}) 
(f) (4,7) 
(g) ({12, 16, 28}) 
If G is a group and there exists some element g such that (g) = G, then G is 
said to be a cyclic group and g is called a generator of the group. There are some 


nifty little theorems about cyclic groups. For example, by Exercise 8.2.23, a cyclic 
group is abelian. Here is another. 


EXERCISE 8.2.29 A cyclic group is countable. 

EXERCISE 8.2.30 Determine with explanation whether each of the following 
groups is cyclic. 

(a) (Z,+,0,—) 

(b) The group from Example 8.1.9 

(co). TR) 

(d) The quaternion group from Example 8.1.18 

(ec) (Q,+,0,-) 
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In this section, we return to the group in Example 8.1.9 to derive it in a formal 
way from the integers. The result of this construction is called the integers 
modulo n (n = 6 in Example 8.1.9). This construction serves as a good illustra- 
tion of a quotient group, which we will discuss as a generalization of the process 
of deriving the integers modulo n. 


Integers Modulo n 


In Example 8.1.9, we noted that (S, @, 0, —) is a group, where S = {0, 1, 2, 3, 4, 5} 
and @ was defined in Table 8.1. You might have noticed the similarity of @ to 
regular addition of integers, except that it was a sort of circular addition. Any sum 
such as 4+ 4 that exceeds 5 is reduced by 6 to guarantee that the sum is in S. 
This circular summing is sometimes called clock arithmetic, where in this case, 
the numbers {0, 1, 2,3, 4,5} can be written around the perimeter of a clock and 
addition can be performed in a natural circular way. This example of a group might 
sound a bit simplistic but actually, in spite of its simplicity, is a most important 
example in group theory. Perhaps we should say because of its simplicity, its 
importance in group theory is particularly striking and beautiful. 

Here is a standard way of constructing this group by beginning with the inte- 
gers. It takes a few steps and might seem a bit esoteric, but it is representative of a 
more general procedure. Study it carefully, because some of the details are left to 
you, and you will mimic the details as you prove the corresponding results of the 
more general procedure in an exercise to follow. Instead of considering the special 
case of S = {0,1, 2,3, 4, 5}, we consider a more general case {0,1,2,...,n — 1}. 


Step 1: Construct the set. Start with the group (Z, +, 0, —), and let n be a given 
positive integer. Let 


(n) = {kn:k € Z} ={..., —3n, —2n, —n, 0,n, 2n, 3n, ...} (8.31) 


be the subgroup of the integers generated by n. Define an equivalence 
relation on the integers as follows. Define 


x=, y ifandonlyif x—ye(n) (8.32) 


That is, x =, y provided x — y = kn for some integer k. This is the same 
definition of equivalence that we used in Exercise 3.6.18, so proving =, 
is an equivalence relation on the integers has already been done. Recall 
from Exercise 3.6.23 that x =, y if and only if x and y have the same 
remainder when divided by n according to the division algorithm. Also, 
recall the equivalence classes that arise from this definition of equivalence: 


Step 2: 


Step 3: 
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{..., —2n, —n,0,n,2n,...} 
[A] ={...,-2n+1,-n+1,1,n4+1,2n+1,...} 
{..., -2n+2,-n+2,2,n+2,2n+2,...} 


(8.33) 
[K]={...,-2n+k,—n +k kn tk ntk,..} 


[n-—1] ={..., -n—-—1,-1,n—1,2n—1,3n—-1,...} 


Notice the very important fact that [0] = (7); that is, the equivalence class 
of the identity in (Z, +, 0, —) is the subgroup from which the equivalence 
is defined. Lump these n equivalence classes into a family and callit Z/(n), 
the integers modulo n. 


Z/(n) = {[0], (1], [2], ...,[~ — 1} (8.34) 


Recall that every equivalence class has infinitely many names, depend- 
ing on the representative element by which we choose to address it. For 
example, in Z/(6), [5] = [—1] = [41]. 


Define a binary operation on the created set. On the elements of Z/(n), 
which are themselves sets, we define a form of addition @,, in the following 
way: 


[a] @, [b] = [a +z b] (8.35) 


where +z, denotes the sum of the representative integers a and b. For 
example, [4] @¢ [5] = [4 + 5] = [9] = [3], where we choose to refer to the 
sum as [3] since [3] is the same set as [9] and 3 is a representative element 
between 0 and 5. 

Notice that this new binary operation ®, is a way of combining two sets 
to produce another set. It is not union or intersection, which up to now 
are the only binary operations on sets we have seen. Instead, @, com- 
bines [a] and [b] by using representative elements from each to produce 
a representative element of the set we are defining to be [a] @, [b]. 


Show that the created set with its binary operation gives rise to a group. 
We must show that @, is well defined, closed, and associative. We must 
also find an identity element and find inverses for all elements. Clearly, ®, 
is closed. For if [a], [b] € Z/(n), then a + bis an integer. Since the equiva- 
lence classes in Z/(n) partition the integers, a + b is in some equivalence 
class. Thus [a + b] € Z/(n). 
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EXERCISE 8.3.1 Finish the proof that Z/(n) with operation @, is a group with 
the following steps. 

(a) Show @,, is well defined. 

(b) Show @» is associative. 

(c) Determine the identity element. 


(d) For an arbitrarily chosen [k], determine the inverse of [k]. 


And we are done. Table 8.36 displays the final product for the case Z/(6). 


@n | 10) [1] [2] B) [4] [5] 
[0] | (0) (1) [2] [B)] [4 [5] 
1); (2) ) [4] [5] [0] 
[2] | [2] [3] [4] [5] [0] [1 (8.36) 
[3] |} 13) [4] [5] [0] (1) [2] 
[4] | [4] [5] [0] [1] [2] [3] 
[5] | [5] [0] [1] [2] [3] [4] 
Let’s take a closer look at the equivalence classes [0], [1],..., [a — 1], for 


there is another standard notation for these sets that we will use below when we 
derive quotient groups in general. Though we know from Eq. (8.33) what is in 
each equivalence class, let’s consider a way to visualize an arbitrary equivalence 
class [k] in terms of the subgroup of the integers (n) = [0] that motivated this 
whole mess in the first place. 

Picture all the integers on the number line in standard fashion, but pretend 
that each integer is like a key on a piano keyboard. Take countably infinitely 
many of your friends, and each of you place a finger on the elements of (n) = [0], 
so that you are pointing to all the multiples of n: {..., —2n, —n,0,n, 2n, ...}. Now 
suppose you want to point to all the elements of some equivalence class [k]. How 
can you do it? Everyone in unison should lift his or her finger off the keyboard, 
and everyone should shuffle over to the right & units, then put his or her finger back 
down. In other words, to generate all the elements of [k], take all the elements of 
(n) and add k to each one, so that you are translating the entire set (n) through 
the integers by k units. Notice in this imagery, there will always be some person 
pointing to one of the integers {0, 1, 2,..., — 1}. Here is another way to describe 
what we have done. 


[k] ={x+k:xe€(n)} (8.37) 
Let’s create new notation for the construction in Eq. (8.37), writing 


(nytk={x+k:xe€(n)} (8.38) 


This is a slight abuse of orthodox notation because it appears that we are com- 
bining the subgroup (n) with an integer k using integer addition. This notation 
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is standard, however, and it makes the definition of addition in Eq. (8.35) look 
like this: 


((n) + a) ®n ((n) +b) = (n) + (a +5) (8.39) 


Take note of the multiple uses of parentheses in Eq. (8.39). Some are used to 
denote a generated subgroup, and others are used merely to group symbols. With 
this notation for the equivalence classes in Z/(n), the Cayley table, written in its 
painfully rigorous form, would look like Table 8.40 for the case n = 6. Once again, 
notice that every coset can be addressed by a unique integer 0, ..., 5. 


On| (2)+0 (+1 M+2 (+3 (+4 (+5 
(ny+O0} (xn) +0 (+1 @W+2 (43 (+4 (+5 
(yyt+1}| +1 @+2 @+3 (M+4 (+5 (nr) +0 
(n)t+2}] (+2 (W)4+3 (+4 (+5 (2)+0 (+i (8.40) 
(ny+3} XF)4+3 (+4 (+5 (+0 (+1 (+2 
(nyh+4}m)+4 (+5 (@+0 (@mst+1 @m+2 (+3 
(ny+5}| (X)+5 (+0 (+1 @m+4+2 (m+3 (+4 


Now let’s relax the notation for the specific context of Z/(n). First, Z/(n) is 
usually written Z,. Also, instead of always writing (n) +k, we allow ourselves 
simply to write k, understanding that we are not talking about a single integer but 
the entire equivalence class of integers of which k is a representative element. We 
also generally revert to the regular addition sign + rather than writing ®@,. With 
this simplified notation, Table 8.40 becomes Table 8.1. 

This all boils down to the fact that mathematicians get a little lazy and let the 
familiar notation from (Z, +, 0, —) also serve for the group Z, with addition, so 
that (Z,, +, 0, —) isa relaxed notation for (Z/(n), @n, (n) + 0, —). This is probably 
all right because we are less interested in fancy notation than we are in the internal 
structure of Z,, as a group. And it is just as helpful to visualize Z,, as clock arithmetic 
with regular old numbers as it is to think in terms of combining equivalence classes 
of integers. Understood this way, we can then write something like 17 + 9 =¢ 2 
and know just what we mean. 


Quotient Groups 


The derivation of the integers modulo n is affected by the fact that integer addition 
is commutative. Did you notice at what point in Exercise 8.3.1 you exploited 
this fact?® Now let’s generalize the program for deriving Z, to create a quotient 
group in a more abstract setting. Before we get started, however, we must explain 
a somewhat restricted approach we will take. At first, we are going to restrict 
ourselves to an abelian group as a starting point, which we should not have to do. 


8 Somewhere in showing @n is well defined. 
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We will allude to the reason for this at the end of the discussion, but for now, do 
not use the fact that the group is abelian anywhere it is not absolutely necessary, 
and take note of the step(s) in the proof where you do use it. The proofs of all 
the steps along the way in this derivation are left to you as exercises. As you read 
them, refer to their parallel steps in Z,,, and note how the new notation here is a 
generalization of the Z/(n) notation to an arbitrary group. 


Step 1: Given anabelian group (G, *, e,-!) and asubgroup H, we define a relation 
= on G in the following way. Define 


a=ybeaxb!eH (8.41) 


By the next exercise, = is an equivalence relation, so that G is partitioned 
into equivalence classes. We denote the set of all equivalence classes G/H, 
which is read “G mod H.” 


EXERCISE 8.3.2 Show that =, is an equivalence relation. 


Before we continue the construction of the group G/H by defining its binary 
operation, let’s see what the equivalence classes generated by the equivalence 
definition in (8.41) look like. In Z, we have a pretty clear picture of what is in 
each equivalence class, as illustrated by Eq. (8.33). In a general group, however, 
we want a way to visualize the elements of [g] for a given g € G. In the next 
exercise, you will show that x € [g] if and only if there exists some A € H such that 
x=hxg. 


EXERCISE 8.3.3 Show that [g] = {hx g:he H}. 


Exercise 8.3.3 says that we can think of the elements of [g] as the result of 
taking all elements of H and using the binary operation” to scoot them all over g 
“amount.” The set [g] is called the right coset of H generated by g, and instead of 
being written in equivalence class notation is denoted 


Hxg={hxg:he H} (8.42) 


A coset is strikingly similar to the construction in (8.37) and can be visualized in a 
similar way as a translation of all elements of H through G by g units, so to speak. 
We then write G/H = {H « g: g € G}, the family of all cosets of H that can be 
created by letting g take on all values in G. 

We are now ready to continue the construction of the quotient group by 
defining a binary operation on G/H. 


9 With g on the right of each h € H, which will matter later. 
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Step 2: Define an operation «4 on G/H in the following way. 
(H * a) *H (H * b) = H x (ax b) (8.43) 


Note the distinction between the uses of « in this definition. We are using 
* to denote the operation we are defining on the family of cosets of H. 
The « in H *a is the standard notation for the coset generated by a, 
comparable to (n) + k in Eq. (8.38). Finally, the *« in (a * b) is the binary 
operation on G. 


Step 3: Show that «7 is a well-defined, closed, associative binary operation on 
G/H, find the identity element, and find inverses for elements. 


EXERCISE 8.3.4 ShowthatG / H with operation * is a group with the following 
steps. 


G1: Show «, is well defined on G/H. 

G2: Show «y is closed on G/H. 

G3: Show «jy is associative on G/H. 

G4: Show H * eis the identity element of G/H. 


G5: Show that every H «a has an inverse in G/H. 


Unless you are unnecessarily sloppy with your algebraic manipulation in com- 
pleting steps 1-3, there is only one place you must exploit the fact that G is abelian, 
and it is the same place where you exploited commutativity of integer addition 
in your work with the integers modulo n. In Section 8.5, we will return to this 
construction in the context of an arbitrary group that might not be abelian. But 
even then, if the program is going to work, we cannot completely do without some 
way to switch the order of certain elements. In Section 8.5, we will guarantee the 
property we need by requiring H to be a special kind of subgroup called a normal 
subgroup. The relationship of a normal subgroup to the entire group will look a 
bit like commutativity, but weaker, so that the construction of the quotient group 
can be realized in as general a group as possible. 

The set of all cosets with its binary operation forms the new group, which we 
call the quotient group generated by H. Here is the definition all in one piece. 


Definition 8.3.5 Suppose (G, x, e,-!) is a group (abelian), and let H be a sub- 
group of G. Define an equivalence relation =, by declaring a =y b provided 
a*b~! € H.Then 


G/H={Hxg:geG} (8.44) 


with binary operation « defined by 


(H * a) *q (H *« b) = H x (a b) (8.45) 
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identity element H*«e=H, and inverses (H * a)!'=Hxa! is called the 
quotient group of G created by modding out the subgroup H. Elements of G/H 
are called right cosets of H. 


For equivalence classes in general, we know that [a] = [b] if and only ifa and b 
are equivalent; otherwise [a] and [b] are disjoint. In the context of Definition 8.3.5, 
these facts become H * a = H * b if and only if a* b~! € H; otherwise H * a and 
H x bare disjoint. In other words, a and b generate the same coset of H if and only 
if ax b~! € H. Otherwise the cosets they generate are disjoint. 


Example 8.3.6 Let G be the set of all functions f : R — R. For two functions f 
and g, we define their sum f + g by the rule [f + g](x) = f(x) + g(x). Clearly, the 
function 0 defined by 0(x) = 0 for all x is the identity element, and — f is the func- 
tion defined by [— f](x) = — f(x). Thus (G, +, 0, —) is an abelian group. Let H be 
the subset of G consisting of all constant functions, which is clearly a subgroup of G. 

What do the elements of the quotient group G/H look like? If f is a given 
function in G, then H+ f = {h+ f:h € H} is the set of all translations of f up 
and down in the xy-plane by the constant functions in H. That is,g €¢ H+ f if and 
only if there exists some constant function / such that g =h+ f. Equivalently, 
g€H+- f, provided g — f € H, which means that g — f is a constant function. 
This might remind you of a fact from calculus. If f is any antiderivative of a func- 
tion f,, then H + f is the set of all the antiderivatives of f;. If you like, every coset 
can be addressed by choosing a representative element, perhaps the function in 
the coset that passes through the origin. 


EXERCISE 8.3.7 Since the real numbers under addition form an abelian group 
with the integers as a subgroup, we may discuss R/Z. 


(a) In constructing R/Z, what is the definition of equivalence that gives rise to 
the right cosets of Z? 
(b) Construct the following cosets by listing some of their elements. 


(i) Z+1.4 
(ii) Z+ V2 
(iii) Z+5 
(c) What is a convenient subset of the real numbers from which we may choose 
a unique representative element of each coset?!° 


(d) Evaluate the following in R/Z. Express your answer as a coset Z + r, where 
r is an element of your answer to part (c). 


(i) Z+2.2) +z (Z+ 8.14) 
(ii) (Z+3.8) +z (Z+0) 


10 There are infinitely many answers, but one of them is considered standard. 
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(iii) (Z+ 3.8) +z (Z+1.2) 
(iv) —(Z + 11.23) 
(v) (Z+3.8) —z (Z+1.2) 


(e) Describe addition in R/Z with an analogy similar to the clock arithmetic used 
to describe addition in the integers modulo n. 


Cosets and Lagrange’s Theorem 


Even if G is not an abelian group and the family of cosets of a subgroup of G do 
not give rise to a quotient group, we can still derive some important results about 
elements and subgroups of G by looking at the cosets of the subgroup. One thing 
to keep in mind if G is not abelian is that we have to distinguish between left and 
right cosets. For a subgroup H and some fixed g € G, we use the notation 


gxH = gH ={gxh:he H} (8.46) 
Hxg=Hg={hxg:he H} (8.47) 
to denote the left and right coset generated by g, respectively. If G is abelian, then 


a particular g will generate the same left and right coset. However, if G is not 
abelian, this might not be the case. 


EXERCISE 8.3.8 Findallleft and right cosets of H = {+1 
group. For each possible g € Q,is gH = Hg? 


i} in the quaternion 


EXERCISE 8.3.9 From the group in Exercise 8.1.19, let H = {0, 1}, which is a 
subgroup. Evaluate 2 0 H and H 02. 


What makes left and right cosets the same for a particular group element is a 
question for Section 8.5. Because the results we want to derive here are the same 
for either left or right cosets, we will look only at right cosets. First, all cosets of a 
given subgroup have the same cardinality. 


EXERCISE 8.3.10 IfGisa group and H is a subgroup of G, then | H| = | Hg| for 
allgeG. 


Regardless of whether G is of finite or infinite order, the number of cosets of H 
in G might be finite. If so, we call the number of cosets of H the index of H in G, 
and we denote this number by (G : H). Naturally, if G is finite, then so is (G : H), 
and the following theorem is immediate as an implication of Exercise 8.3.10. 


Theorem 8.3.11 (Lagrange). Ina finite group, the order of a subgroup divides 
the order of the group. 
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Proof. Let G be a group, and let H be a subgroup of G. Since all cosets of H are 
disjoint and have the same cardinality as H, we have that 


IG] =(G: H) x |A| 


If x is an element of a finite group G, then Lagrange’s Theorem says that the 
order of the subgroup generated by x divides the order of G. Since the cardinality 
of (x) is the same as the order of the element x (Theorem 8.2.21), we have that 
o(x) | o(G). Beginning here, it is possible to prove several results about elements 
of a group. 


EXERCISE 8.3.12 Suppose o(G) =n, and g € G. Then g” = el 


EXERCISE 8.3.13 Suppose G is cyclic of order n and k is a positive integer. 
If k | n, then there exists a subgroup of G of order k. 


EXERCISE 8.3.14 A group of prime order is cyclic.!* 


Permutation Groups 


In this section, we take an in-depth look at a particular group and two particularly 
important subgroups that derive from it. Our main purpose is to become familiar 
with an especially important non-abelian group. Then in Section 8.5, we will use 
this group as motivation for the definition of a normal subgroup. Requiring a 
subgroup to be normal is just the right thing to patch up the hole we left in our 
derivation of quotient groups by requiring that the group be abelian. 


Permutation Groups Defined 


Let A be a non-empty set, and let S be the set of all bijections from A to itself. We 
want to take a close look at the group formed on S with the operation of composi- 
tion. First, note that composition is well defined and closed on S by Exercise 8.1.4, 
so that S with the operation of composition has properties G1 and G2. 

The fact that composition is associative (G3) is a mere exercise in the 
manipulation of parentheses. For if we pick any a € A, then 


[(fog)oh}(a) = (f og)[h(a)] = f(g(h(@))) = fl(g oh)@] = [fo (g 0 A) (a) 
(8.48) 


Thus [(fog)oh](a) = [fo(goh)](a@) for all ae A, so that (fog)oh= 
fo(goh), and o is associative on S. Writing i: A > A, the identity function, 


1) Apply Theorem 8.2.21 and Lagrange’s Theorem to (g). 
A Any element except e is a generator. 


8.4.2 
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then for a given f € Sandanyaée A, 
(f oi)(a) = fli(@] = fla) and (io f(a) =i f@] = fla (8.49) 


Thusio f= foi= f for all f € S and i is the identity element (G4). Further- 
more, by Theorem 4.4.7, every f € S has an inverse f~! € S. Since 


fif'@l=a=ila and f-'[f@]=a= ila) (8.50) 


for all a € A, we have fo f-! = f-!o f =i (G5). Thus (S, 0, i,~!) is a group, 
called the permutation group on A, and elements of S are called permutations 
of A. 


The Symmetric Group 


Let N, = {1,2,3,..., 2}. The permutation group on N, is denoted S, and is called 
the symmetric group on n elements. To be concrete, we will work here with S¢. 
One way to visualize what a particular f € Sg does is to imagine six fixed slots, 
numbered 1-6, with some sort of object in each position. If f(2) = 5, this means 
that the object in slot 2 is moved to slot 5. Thus every element of S¢ does a sort 
of fruitbasket turnover of the ordered numbers (1, 2, 3, 4, 5, 6). Several notations 
can describe this effect. One is to display the image of every element of N,, like 


this: 
ey ee eee es 
f=\,42465 (S.2)) 


which means f(1) = 3, f(2) = 1, f(3) = 2, f(4 =4, fS) = 6 and f(6) = 5. When 
we compose two permutations in Sg, we work right to left as usual. For example, 


if we write 
123 45 6 ike 
PON Th MAG. ey) 
we may calculate f o g element by element. To find (f © g)(1), we see that 1 is first 


mapped to 3 by g, and then 3 is mapped to 2 by f. So (f 0 g)(1) = 2. Doing the 
same for the remaining elements yields 


123 4 5 6 123 4 5 6 
fog= fe) 
3 246 5 3 A 426) 2." 3 


(3 


ay 


WO N 
A 
nN 
_ 
AN OW 
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EXERCISE 8.4.1. Calculate go f to show that (S¢, 0, i,~! ) is not abelian. 


By flipping the expression for f in Eq. (8.51) upside down and reordering the 
columns, we see that 


pi_(i2345 6° _ (123456 en 
“ “\3 124 6 5) \2 31465 j 


Another way to describe an element of Sg more succinctly is with cycle notation, 
where f = (132)(56) means 1 is mapped to 3, 3 is mapped to 2, and 2 is mapped 
to 1 for one cycle. The other cycle indicates 5 is mapped to 6 and 6 is mapped to 5. 
The absence of 4 means it is mapped to itself. Since (132) and (56) are disjoint, it 
doesn’t matter which you write first. Nor does it matter whether you think of f 
as a single permutation or as the composition of (132) and (56). If you want, you 
can scroll the numbers in a cycle around to make it start with any element in the 
cycle. So (132) = (213) = (321). The identity mapping in S,, is generally written (1). 
If you compose several permutations in cycle notation that are not disjoint, you 
can simplify the expression into disjoint cycles. 


Example 8.4.2 Simplify f = (132)(14)(24)(23)(563). 


Solution Find f(1) first by tracing 1 through the cycles. Since composition of 
functions is performed from right to left, we see that 1 is first mapped to 4 by (14) 
and then 4 is not mapped elsewhere by (132). Thus f(1) = 4. Now find (4) by 
noting that 4 is mapped to 2 by (24), then 2 is mapped to 1 by (132). Thus f(4) = 1, 
and we have completed one cycle (14). Continuing with 2, we see that 2 is mapped 
to 3 by (23), then 3 is mapped to 2 by (132). Thus f(2) = 2, and we can omit it 
in the cycle notation. Continuing with 3, we see f(3) = 5, f(5) = 6, and f(6) = 3. 
Thus the simplified expression is f = (14)(356). 


EXERCISE 8.4.3 Simplify (253) (12) (45) (36) (36) into disjoint cycles. 


How many elements are there in Sg? What about in S, for any n? By Exer- 
cise 4.8.9, there are P(n, n) = n! bijections from an n-element set to itself. Another 
way to conceptualize the question is to ask how many ways there are to arrange 
the elements of N,, in the bottom row of Eq. (8.51). There are n possible entries 
for the first slot, then n — 1 remaining possibilities for the second slot, and so on. 
Multiplying these, we see |S,,| = n!. Thus S, is a group on n! elements, and if n > 3, 
S, is not abelian. 


Example 8.4.4 In S¢, determine the subgroup generated by f = (132)(56). 


Solution Since S¢ is finite, f has finite order. By Theorem 8.2.20, the subgroup 
generated by f contains all powers of f. 


8.4.3 
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f? = fo f = (132)(56) o (132)(56) = (123) 
f? = f? 0 f = (123) 0 (132)(56) = (56) 
fi = fro f = (56) o (132)(56) = (132) (8.55) 
f? = fio f = (132) 0 (132)(56) = (123)(56) 
f° = fo f = (123)(56) 0 (132)(56) = (1) 
Thus 


QHOF LLP 


(8.56) 
= {(1), (132)(56), (123), (56), (132), (123)(56)} 


| 
EXERCISE 8.4.5 In S6, determine the subgroup generated by (125) (346). 


EXERCISE 8.4.6 For g = (12) and h = (34) in S4, determine ({g,h}), the 
subgroup of S4 generated by g and h. 


EXERCISE 8.4.7. In S4, let f = (123) and g= (14). Determine whether g 
generates the same left and right cosets of the subgroup generated by f. 


The Alternating Group 


Now let’s look at an important subgroup of S,. A permutation of the form (ij) is 
called a transposition because all it does is switch the positions of two elements. 
Notice (ij)~! = (ij) and (ij) = (ji). If i= 1 (or j = 1, it doesn’t matter), then (ij) 
becomes (1/). If neither i nor j is 1, then 


Gj) = GNA /)di) (8.57) 


By Eq. (8.57), every transposition that does not involve 1 can be written as a 
product of three transpositions, each of which involves 1. So if you’re playing 
some game where objects are lined up in positions 1, ..., and you want to swap 
the positions of the objects in positions i and j, it is possible to do it even if you 
restrict yourself to swapping an object’s position only with the object in position 1. 
It just takes three moves instead of one. But notice that both (ij) and its equivalent 
in (8.57) use an odd number of transpositions. 

Even though (ijk) is not a transposition, it can be written as a composition of 
two transpositions. One way to do it is 


(ijk) = (ik)(ij) (8.58) 


If none of {i, j, k} is 1, what does (8.58) become if we require that all transpositions 
involve 1, as in (8.57)? Taking a hint from Eq. (8.57) and applying it to both (ik) 
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and (ij), we can write 
(ijk) = ik) G/) = ADNAN AD)dDA/)dA (8.59) 


Since (1i) is its own inverse, the two transpositions in the middle of Eq. (8.59) 
cancel to yield 


(ijk) = dNdA) AA) di) (8.60) 


Now (ijk) = (jki) = (kij). So if one of {i, j, k} is 1, we may assume i = 1, and 
observe that (1 jk) = (1k)(1J). But notice this one fact. Every form of (ijk) that we 
have written here involves an even number of transpositions. 

Suppose we now consider the cycle o = (x1, x2,...,%m), Where no x, = 1. Can 
you take a hint from Eq. (8.60) and jump right to a similar form for o that involves 
only transpositions of the form (1x,)? How about 


o = (1x1) (14m) 1%m—1) A %m—2) > ++ (x2) (141) (8.61) 


If some x, = 1, rotate the entries of o so that x; = 1. Then Eq. (8.61) works 
by deleting the transpositions on each end. Be assured there are many other 
answers. One thing we would like to know is whether all the ways of writing o 
with transpositions have something in common. 

From all this work, we can see that any element of S,,, once written in cycle 
notation with disjoint cycles, can be broken down into a composition of trans- 
positions in at least some way. The identity can be thought of as requiring zero 
transpositions, or if you prefer, we can write (1) = (12)(12) for n > 2. One char- 
acteristic of elements of S, that we want to point out, but not prove in this text, 
is that of all possible decompositions of a particular permutation into transposi- 
tions, either they will all involve an even number of transpositions or they will all 
involve an odd number. This is not a trivial fact to demonstrate. So we will just 
accept it here and leave the proof to your later coursework in algebra. Thus the 
n! elements of S, are partitioned into two classes. If f always decomposes into an 
even number of transpositions, then f is called an even permutation. Similarly, if 
f always decomposes into an odd number of transpositions, then f is called an 
odd permutation. 

Now let’s create an important subset of S,. Let 


An ={f €S,: f is an even permutation} (8.62) 
What do you think the relationship between A, and S,, is? 
EXERCISE 8.4.8 A, isa subgroup of S). 
We call A, the alternating group on n elements. 


Given that |S,,| =7!, how many elements do you think A, has? Your first 
thought might be that, since every element of S, is either even or odd, it would be 


8.4 Permutation Groups 273 


Figure 8.1 Rigid square used to generate the dihedral group. 


only natural that half would be even and half would be odd, so that |A,| = n!/2. 
If so, you’re right. In Exercise 8.3.10, you showed that all cosets of a subgroup 
have the same cardinality. So if you can show that A, has precisely two cosets, 
itself and its complement, it will follow that |A,| = n!/2. 


EXERCISE 8.4.9 Show that |A,| = n\/2 by showing that A, has precisely two 
cosets. !3 


8.4.4 The Dihedral Group 


Now let’s look at another important subgroup, this time for the particular group 
S4. Consider a rigid square with the numbers 1, 2,3, 4 etched on its corners, sitting 
in the xy-plane where the positions of each corner are also written in the plane. 
(See Fig. 8.1.) Consider the following two moves for the square. 


1. A rotation 90° counterclockwise, a move we will call p (the Greek letter rho). 
2. A flip upside down, sending the top corner to the bottom, and vice versa, and 
leaving the side corners fixed. Call this move ¢ (the Greek letter phi). 


The move p can be expressed as an element of Sy: p = (1234). Visualize po 
and the cycle notation expression of it as sending the corner that initially occupies 
position 1 in the xy-plane to position 2, the corner initially in position 2 to position 3, 
and so on. Similarly, ¢ = (24), which sends the corner initially in position 2 to 
position 4, and vice versa. If we consider all possible ultimate positions of the 


13 When do f; and f generate the same coset of Ay? See the comments after Definition 8.3.5. 
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square that can result from combining moves p and ¢ in any way by composition, 
we have created a way of visualizing the subgroup of S4 generated by {p, ¢}. This 
subgroup of Sq is called the dihedral group and is denoted Dg. 

First, let’s note how many elements there are in Dg. We ask how many possible 
ultimate positions are there for the square that can result from a combination 
of rotations and flips. Perhaps it is clear that the square can end up either top 
side up or top side down, and in any one of four states of rotation. Thus eight 
distinguishable ultimate positions are possible, and o(Dg) = 8. 

Let’s create a Cayley table for Dg in the following way. Each element of Dg, that 
is, each ultimate position for the square, can be obtained by doing any necessary 
rotation first, then flipping afterward, if necessary. Thus every element of Dg can be 
written in the form ¢” p”. As an example, ¢p* would rotate 270° counterclockwise 
and then flip. This maneuver could be written in cycle notation as (24)(1234)3. 


EXERCISE 8.4.10 Simplify ¢p° into cycle notation with disjoint cycles. 


Since p has order 4 and ¢ has order 2, every element of Dg can be written as 
go” p", where 0 < m <1 and 0 <n <3. Writing elements of Dg in this way and 
denoting the identity by i, we have 


Dg = {i, p, 0°. p°. b. 0, 60°. $0} (8.63) 


To fill in the Cayley table values, we must combine all elements of Dg and 
write the compositions in a form from Eq. (8.63). This takes a little bit of work 
but yields some useful principles as you work through it. For example, what is 
(gp) o (f) = dog? The first maneuver is a flip, followed by a 90° rotation and flip. 


EXERCISE 8.4.11 Which element of Dg from (8.63) is equivalent to go? 


Table 8.64 contains a few of the entries for the Cayley table of Dg. In the next 
exercise, you will finish out this table and be pointed to a systematic approach that 
can save you a lot of time. Remember! In a Cayley table, the entry down the left 
column is written on the left, and the entry across the top row is written on the 
right. Since composition is read from right to left, it means that the entry from the 
top row is actually performed first! 


o| i p fp Pp & gp 0 0° 

i i p @ pp & gp gp ¢° 

P| pple a ge oe 

cae a ee (8.64) 
eG |) ee Be 

cw) g 
gp | op 
op | 07 
¢0° | ¢0° 
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EXERCISE 8.4.12 Complete Table 8.64.!4 


Normal Subgroups 


Let’s return to quotient groups and recall from Section 8.3 how our assumption 
that G is abelian got us over the hump of showing that the operation on G/H from 
Definition 8.3.5 is well defined. For notational simplicity, we write 


Ha * Hb = H(ab) (8.65) 


using juxtaposition for the binary operation on G and in the coset notation. To 
show « is well defined on G/H, we would suppose 


Ha, = Hay and Hb, = Hb (8.66) 
and try to show from this that 
H(a,b1) = H(anb2) (8.67) 


Since Eq. (8.67) is a set equality, one way to approach our task is to chase an 
element from one side to the other and back. A more efficient approach, however, 
is to exploit Exercise 8.3.3 and the fact that equivalence classes partition a set. 
In other words, to show that Eqs. (8.66) imply Eq. (8.67), it suffices to show that 
if a, =H a2 and bj =y bz, then ajb1 =H azb2. Let’s try to do that now, and see 
where we get stuck in the absence of G being abelian. 

Suppose a, =v a2 and by = bp. Then aia", bib;! € H. We need to show 
that (a,b1)(ayb2)—! € H, which is equivalent to a,b, bz lay Te, Perhaps the con- 
venience of commutativity is evident at this point because the only assumptions 
we have involve aja, ' and bib, | We must proceed with extreme caution, so we 
should spend some time thinking out loud. 

First, we know that there exists hj ¢ H such that b\b, ! = hy. Thus our task 
now is to show that ajh,a, ' € H. At this point we’re stuck because the only way 
to apply our assumptions to an expression involving a; and a, ! is if they can be 
associated. In our case, 1 is in the way. One possible assumption that would get 
us past this problem would be to assume that h; commutes with a; | which would 
require us to assume that every element of H commutes with every element of G. 
This is a pretty strong assumption about H, and we can do better. 

The standard assumption about H, weak as possible but still strong enough 
to give us a well-defined operation on G/H, is to assume a way to reorder terms 


First convert all expressions of the form p'¢ to their equivalents in the form ¢p/. Use these results to 
take expressions of the form ¢° o'¢“ p” and reorder p’¢“ in the middle. Thus all ps will gravitate to the 
right and all gs to the left. 
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where we can replace hia; | with ay ‘hy for some h2 € H. If we have this, then 
ayhya;' = ajay ‘ho, and the fact that aja;" € H allows us to conclude that 


(a,b) (anb2)~! = aybyb; ‘a5! = ayhiay" = ajay ho e€H (8.68) 


So here is the demand on H that will get us over the hump of showing that the 
operation on G/H is well defined when G is not abelian. Since a; 1 in our analysis 
above could be any element of the group, and h; could be any element of H, we 
arrive at the following definition of a special sort of subgroup where we can be 
sure that the operation on G/H is well defined. 


Definition 8.5.1 Suppose H is a subgroup of a group G. If for all g € G and 
h € H, there exists h; € H such that hg = gh, then H is called anormal subgroup 
of G, and we write H<G. 


Definition 8.5.1 only allows you to swap hg for gh1. It does not explicitly allow 
you to swap gh with some /1g. But you can show that it does. 


EXERCISE 8.5.2 Suppose H is a normal subgroup of G. Then for all g € G and 
h € H, there exists h, € H such that gh = hig. 


All our work here shows that the operation defined in Eq. (8.65) is well defined 
if H is anormal subgroup of G. Furthermore, your other work from Exercise 8.3.4 
where you did not exploit the abelian nature of G completes the proof that G/H 
is a group, and we arrive at the following. 


Theorem 8.5.3 Suppose G is a group and H is a normal subgroup of G. Then 
G/H with binary operation * defined by (Ha) « (Hb) = H(ab) is a group with 
identity He and inverses (Ha)~! = HaW!. 


There are ways other than Definition 8.5.1 to define normal subgroup, and 
different authors approach the idea in different ways. As we will see in Section 8.6, 
normality can be naturally defined in terms of mappings between groups. But we 
have chosen our definition, so any equivalent forms will have to be demonstrated 
as theorems. If we reread the analysis that led us to Definition 8.5.1, another way 
to state what we needed might jump out at us. Rather than state that there exists 
hz € H such that a, ‘hy = ha, a we simply could have insisted that there exists 
ha € H such that hz = anhyay", or simply that anhya;" € H. This has a natural 
appeal because testing a subgroup for normality then reduces simply to making 
sure that the criterion in the following theorem is satisfied. 


15 Apply Definition 8.5.1 to h7!g7!. 
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Theorem 8.5.4 Suppose H is a subgroup of G. Then H is normal if and only if 
for allh € HandgeG,g 'hge H. 


It is one thing to say that a subgroup is closed under the operations of the 
group. But the fact that g~'hg € H for allh € H and g € G adds a new dimension 
to the closure of H by saying, in a sense, that elements of H cannot be kicked 
outside of H by boxing them inside things of the form g~'Og. For a givenh € H, 
an expression of the form g~'hg is called a conjugate of h. Theorem 8.5.4 says that 
H is normal if and only if the conjugates of all its elements lie in H. Thus choosing 
any h € H and g € G, it might be that g~'hg is different from h, but at least it’s 
still in H. This is the way you will want to argue the following. 


EXERCISE 8.5.5 The alternating group A, is a normal subgroup of Sp. 


If G is not abelian and a subgroup Z is not normal, then a conjugate of an 
element of H might or might not be in H. For example, using p = (1234) € Dg and 
(12) € Sa, 


(12)~! (1234) (12) = (12)(1234) (12) = (1342) ¢ Dg (8.69) 


which by Theorem 8.5.4 proves that Dg is not normal as a subgroup of Sy. On the 
other hand, using (123) = (13)(12) € Ag and (14) € Sa, 


(14)! (123)(14) = (14)(123)(14) = (234) = (24)(23) € Ay (8.70) 
which is a consequence of the fact that Aq is a normal subgroup of S4. 


EXERCISE 8.5.6 Use Theorem 8.5.4 to show that H = {(1), (123), (132)} is a 
normal subgroup of $3. 


EXERCISE 8.5.7 IsH = {(1), (12)} anormal subgroup of $3? Prove or disprove. 


EXERCISE 8.5.8 From Exercise 8.4.6, H = {(1), (12), (34), (12)(34)} is a sub- 
group of S4. Prove or disprove that H is normal in Sq. 


If G is abelian, conjugation is not particularly interesting. 


Theorem 8.5.9 A group Gis abelian if and only ifevery h € G has no conjugates 
other than itself. 

The proof of Theorem 8.5.9 should be immediately clear, for the conditions 
that gh = hg for all g,h € Gand g-'hg = h for all g,h € G are identical. 

If we think of g as fixed and allow / to take on all values in H, we create what 
we call a conjugate of the subgroup H. Notationally, we write this as 


g 'Hg ={g'hg:he H} (8.71) 
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One interesting characteristic of the conjugate of a subgroup is that it is also a 
subgroup of G. 


EXERCISE 8.5.10 Let Hbea subgroup of a group G and fix g € G. Then g~! Hg 
is a subgroup of G. 


Given two subgroups H; and Ap, to say that H> is a conjugate of H; therefore 
means that there exists some g € G such that Hp = g-! Aig. 


EXERCISE 8.5.11 Let G be a group, and let H; and H> be subgroups of G. 
Define Hy = Hj, provided A) is a conjugate of H;. Show that = is an equivalence 
relation on the family of all subgroups of G. 


EXERCISE 8.5.12 For H = {(1), (12), (34), 2)(4)}, which is a subgroup of S4, 
determine the conjugate subgroup (13)~! H(13). 


For another example of a conjugate subgroup, consider Dg as a subgroup of 
Sq and let g = (12). For a given 6 € Dg, the expression g |g can be thought of in 
the following way. Since g acts first, it switches the numbers in positions 1 and 2 
on the square (an illegal move in Dg). Then 6 does a rotation and/or flip on this 
newly labeled square. Finally, g~! switches again the numbers in positions 1 and 
2 on the square. For example, 


ge! pg = (12)(1234)(12) = (1342) ¢ Ds (8.72) 


Transforming all the elements of Dg in this way creates another subgroup of S4 that 
you might need to play around with to understand. It turns out that g~!Dgg as an 
algebraic structure is just like Dg, except that the rigid square we described before 
will not work as a way to visualize it. Instead, picture the numbers {1, 2, 3, 4} being 
pushed from corner to corner by g~!pg and g~!@g according to Figure 8.2. The 


go 'pg go '¢9 


Figure 8.2 Effects of gpg and g—!$g on the square. 
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point to be made is that g~! Dgg is different from Dg, though it is a subgroup of 
S4. They both contain the identity, but except for that overlap, they cut through S4 
in completely different directions. 

If H is a normal subgroup of G, the situation is a little different concerning 
conjugates of H. The proof of the following should be quick. It says H is normal 
in G if and only if H has no conjugates other than itself. 


EXERCISE 8.5.13 Suppose H is a subgroup of G. Then H is normal if and only 
if g-'Hg = H forall geG. 


In showing that the binary operation on a quotient group is well defined, 
another feature of H that would get us over the hump from /2a2 to az2h3 involves 
linking the right coset Hap to the left coset az H. That is, if we had been given that 
Hay = aH, our problem would have been solved in precisely the same way. So 
here is another equivalence of normality. 


EXERCISE 8.5.14 Suppose H is a subgroup of G. Then H is normal if and only 
if Hg = gH forallg eG. 


To sum up, notice how our retreat from the requirement that G be abelian 
motivated the definition of a characteristic of subgroups that allows us still to 
construct the quotient group. And note the following. 


EXERCISE 8.5.15 Every subgroup of an abelian group is normal. 


Since our definition of normal subgroup was inspired by a retreat from the 
global condition of G being abelian, let’s compare and contrast normality in all its 
forms to similarly worded statements about abelian groups. 


If G is abelian and H < G: If H<G: 
1. Forallg,h eG, gh=hg. For all g¢€ G and he H, there exists 
hy € H such that hg = gh. 
2. Forallg,neG,g'hg=h.  Forallge Gandhe Hg 'thge H. 
3. Forallg¢G,g-'Hg=H. For all g € G, g-'Hg = H. 
4. Forallg ¢ G, Hg = gH. For all g € G, Hg = gH. 


EXERCISE 8.5.16 In Exercise 8.2.8, you showed that the center of a group G is 
a subgroup of G. Prove that the center of G is normal in G. 


EXERCISE 8.5.17 In Exercise 8.2.9, you showed that the intersection across a 
family of subgroups of G is itself a subgroup of G. Show that the intersection 
across a family of normal subgroups of G is normal in G. 


As a final observation, consider four sets H C H; C G C G,, where A isa 
normal subgroup of G. Normality of H as a subgroup of G says something about 
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the way elements of H behave in the presence of elements of G, and not simply 
how they behave among themselves. So, if H; is also a group, then the fact that 
H is normal in G will imply H is normal in H, as well. For if g~'hg € H for all 
g € G, then certainly the same is true for all g € H,. Now if G,; is a group, we 
know that H is a subgroup of G;. However, it might be that H is not normal in Gj. 
Even though g~!hg € H for all g € G, there might be some g € G; — G for which 


gthg ¢ H. 


Example 8.5.18 Beginning with S4, create the following subgroups. First, let 
H = {(1), (12)}. Since (12) is its own inverse, H is a subgroup of Sy. Let G = 
{(1), 12), (34), 12)(4)}. By Exercise 8.4.6, G is a subgroup of S4. Furthermore, 
G is abelian. By Exercise 8.5.15, H is anormal subgroup of G because G is abelian. 
However, by the next exercise, H isnot normalin Sy. 


EXERCISE 8.5.19 Show that H = {(1), (12)} is not normal in S4 by finding a 
conjugate of (12) that is not in H. 


Group Morphisms 


Now that we have a basic understanding of groups and some of their internal 
structure, let’s turn our attention outward to a special type of function from one 
group to another. The special feature we want these functions to have is that they 
preserve the binary operation. 


Definition 8.6.1 Suppose (G, x, eg, !) and (H,-,ey,~!) are groups, and sup- 
pose ¢: G > His a function with the property that d(x * y) = @(x)-¢(y) for all 
x,y € G. Then ¢ is called a homomorphism, or simply a morphism from G to H. 
If ¢ is one-to-one, it is called a monomorphism. If ¢ is onto, it is called an epimor- 
phism. If ¢ is both one-to-one and onto, it is called an isomorphism, and we write 
G = H, which is read “G is isomorphic to H.” If ¢ : G > G is an isomorphism, 
¢ is called an automorphism. 


EXERCISE 8.6.2 Let Zbe the group of integers under addition and E the group 
of all even integers under addition. Assuming that each of the following is a 
function, prove the following. 


(a) Show that ¢1 : Z > E defined by ¢1(n) = 2n is an isomorphism. 


(b) Show that ¢2 : Z > E defined by ¢2(n) = 4n is a monomorphism that is not 
an isomorphism. 


(c) Show that ¢3 : Z > Z defined by ¢3(n) = —n is an automorphism. 
(d) Show that ¢4 : Z > Z defined by ¢4(n) = 2n + 2 is not a morphism. 
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EXERCISE 8.6.3 Let Z be the group of integers under addition, let n be a 
positive integer (n > 2). Show that @: Z — Z, defined by $(k) = (n) +k is an 
epimorphism that is not an isomorphism. 


Example 8.6.4 Let G = (Rt, x, 1,~!) and H = (R, +, 0, —). Even though we 
have not discussed logarithms in this text, your work in pre-calculus reveals that 
#(x) = Inx is an isomorphism. For In x is a bijection from Rt to R and satisfies 
o(xy) = In(@xy) =Inx+Iny=¢()+¢()). & 


Example 8.6.5 Let G and H be groups, and define ¢ : G > H by $(x) = ey for 
all x € G. Then ¢ is called the trivial morphism. Mi 


EXERCISE 8.6.6 Define addition in R2 by 
(x1, v1) + (2, y2) = 1 + X2, 91 + 2) 


which is clearly a group. Let ¢ : R* > R be defined by @[(x, y)] = x. Show that 
¢ is an epimorphism that is not an isomorphism from the additive group R? to 
the additive group of real numbers. (This is called a projection morphism because 
every point in the xy-plane is mapped directly up or down to its x-coordinate.) 


EXERCISE 8.6.7 Let G be a group, and define ¢: G > G by ¢(g)=g!. 
Assuming ¢ is a bijection from G to itself, show that ¢ is an automorphism of G if 
and only if G is abelian. 

The morphic behavior of ¢ : G > H does not explicitly require that particular 
elements of G must map to particular elements of H. However, there are some 
restrictions of this sort inherent in the definition. 


Theorem 8.6.8 If¢: G— A isa group morphism, then: 


1. o(eg) = eH. 
2. Forallx € G, d(x!) = [@(@]71. 


We will prove part (1) and leave part 2 to you as an exercise. 


Proof of Part (1). 


en b(eG) = o(eG) = (eG * eG) = b(eG) : b(eG) (8.73) 


By cancellation, ¢(eg) = en. 


EXERCISE 8.6.9 Prove part (2) of Theorem 8.6.8. 
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By using part (1) of Theorem 8.6.8 as the root of an induction argument for 
n > 0, then using part (2) to take care of n < 0, you can show the following. 


EXERCISE 8.6.10 If¢:G—> Hisa group morphism and g € G, then for all 
integers n, 


6(g") = [¢(g)]" (8.74) 


EXERCISE 8.6.11 Restate Theorem 8.6.8 and Exercise 8.6.10 in their additive 
forms. 


The statement G = H in Definition 8.6.1 looks like a form of equivalence. It is a 
statement about the existence of a bijection from G to H, with the additional prop- 
erty that it preserves the binary operation. By Exercise 4.4.9, most of the work in 
proving that = has properties E1—-E3 has been done and will therefore require only 
references to applicable theorems. However, in each property E1—-E3, something 
will have to be shown about the morphic behavior of the functions involved. 


EXERCISE 8.6.12 The relation & in Definition 8.6.1 is an equivalence relation 
on the set of all groups. 


The word isomorphism has a connotation to it that deserves pointing out. For 
G to be isomorphic to H means that G and H are essentially the same group, in 
the sense that all the elements from one can be swapped one for one with those 
in the other and the internal relationships between them as expressed by x are 
retained by -. By giving a new name to every x € G, namely, ¢(x), and swapping 
the binary operation symbol in G for the symbol in H, we have effectively dressed 
(G, *, eg, ) in the clothing of (4, -, en, |). So to be able to map every element 
of G to a unique and distinct element of H, exhausting all elements of H and 
preserving the binary operation, is to show that, as far as their structure as groups 
is concerned, they are identical. Here are some illustrations of this principle. 


EXERCISE 8.6.13 Suppose ¢: G > H isa group isomorphism. Then, 


(a) If G is abelian, then A is abelian. 
(b) If G is cyclic, then H is cyclic. 


Instead of writing Rng(@), we will usually write 6(G). Subgroups of the domain 
and codomain are related in the following. 


EXERCISE 8.6.14 Suppose ¢: G > H isa group morphism. Then 


(a) $(G) is a subgroup of H. 
(b) If N is a normal subgroup of G, then (J) is a normal subgroup of $(G). 
(c) If N isa normal subgroup of H, then #~!(N) is a normal subgroup of G. 
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Notice that part (b) of Exercise 8.6.14 does not say that @(N) is a normal 
subgroup of H. If ¢ is not onto, it is possible that #(N) is normal in ¢(G) but not 
normal in H. 

Theorem 8.6.8 ensures that eg always maps to ey under a group morphism. 
It might be that other elements of G also map to the identity in H. In Exercise 8.6.3, 
every multiple of n maps to the identity coset (n) + 0. In Example 8.6.5, every 
element of G maps to the identity in H. We give a name to the set of all elements 
in G that map to the identity by a morphism. 


Definition 8.6.15 Suppose ¢: G — H is a group morphism. Then the pre- 
image of ey is called the kernel of @ and is denoted Ker(¢). If Ker(@) contains 
only eg, we say that the kernel of @ is trivial. 


EXERCISE 8.6.16 If¢:G— Hisa group morphism, then Ker(¢) is a normal 
subgroup of G. 


An interesting property of group morphisms is that you can sometimes learn 
a lot about the behavior of ¢ across all of G by looking at its behavior at certain 
places in G. Clearly, if ¢ is one-to-one, then Ker(¢) is trivial, because then only eg 
will map to ey. But, interestingly, the converse of this is also true. 


EXERCISE 8.6.17 Suppose ¢ : G > H isa group morphism. If Ker(¢) is trivial, 
then ¢ is one-to-one. 


So if 1 (ez) has only one element, then ¢ !(y) has only one element for every 
y € ¢(G). We can go even further to show that 


oO] = [Ker@)| (8.75) 


for all y € $(G), so that all pre-image sets of individual elements in the range of 
¢ have the same cardinality. Probably the easiest way to do this is to exploit the 
fact that the kernel is a normal subgroup of G in order to prove something even 
stronger than Eq. (8.75). The next exercise says that the pre-image of each element 
in the range of ¢ is simply one of the cosets of the kernel. By Exercise 8.3.10, all 
cosets of a subgroup have the same cardinality, so that Eq. (8.75) follows. In the 
next exercise, we use left cosets to keep the notation uncluttered. 


EXERCISE 8.6.18 If ¢:G—> Hisa group morphism, then for all y € ¢(G), 
there exists a € G such that ¢ '(y) = aKer(@). 


If you’ve caught on to what’s happening here, you might have observed that 
any group morphism ¢: G > H makes some interesting statements about the 
internal structure of G. A morphism ¢ gives rise to a normal subgroup of G, 
namely, Ker(¢). From there all our theory from Section 8.5 applies to present us 
with a quotient group G/ Ker(@), with its binary operation a Ker(¢) « b Ker(¢) = 
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(ab) Ker(@), identity Ker(@) and inverses [a Ker(¢)|~! = a“! Ker(@). So any time 
you are given a group G and can manage to find a morphism into some other 
group, then you have managed also to find a normal subgroup of G and motivate 
a quotient group from it. 

Also, if youre like a lot of people at your stage of the mathematical game, you 
wish there were a better way to see what’s going on inside this new quotient group 
G/ Ker(¢) than by visualizing cosets whacking each other around. Well, there is 
another way to visualize G/ Ker(@), for it is essentially the same as (isomorphic 
to) ¢(G). We will present a full-blown statement and proof of that soon. 

A group morphism ¢: G > H gives rise to a normal subgroup of G and a 
quotient group from it. Now let’s go the other way. Given a group G and any 
normal subgroup N, we can create a group H and an epimorphism ¢: G > H 
such that Ker(¢) = N. Showing this does not involve much we have not seen 
before, mostly just some observations. You will provide the only missing detail as 
an exercise. 


Theorem 8.6.19 If (G,*,e,~!) is a group and N is a normal subgroup of G, 
then ¢: G > G/N defined by $(x) = Nx is an epimorphism whose kernel is N. 


Proof. From our previous work, ¢ is defined on all of G and is well defined. 
Also, ¢ is onto because every element of G/N is a coset of N, generated by some 
x €G, and $(x) = Nx. We must show that ¢ behaves morphically and satisfies 
Ker(¢) = N. Pick x, y € G. Since the binary operation on G/N is well defined, 


pay) = N * (xy) = Nx * Ny = o(@)o(y) (8.76) 


Finally, by Exercise 8.6.20, Ker(¢) = N. 


EXERCISE 8.6.20 Finish the proof of Theorem 8.6.19 by showing that 
Ker(@) = N. 


So we can have it both ways. Two groups and a morphism give rise to a normal 
subgroup of the domain. And a group with a normal subgroup gives rise to another 
group and a morphism onto it. Here is the final tie. Whichever you start with and 
use to create the other, the range of the morphism and the quotient group of G 
are isomorphic. That is, ¢(G) and G/ Ker(@) are essentially the same group, as 
the following theorem states. For notational simplicity, we assume that the given 
morphism is onto so that we don’t have to distinguish between H and $(G). 


Theorem 8.6.21 Suppose ¢: G > H isa group epimorphism. Then 
G/ Ker(¢) = H (8.77) 


We will supply a skeleton of the proof here, though most of the details are left 
to you in the exercise that follows. As we do so, look at Figure. 8.3, which illustrates 
all the groups and mappings involved. First, there are the given groups G and H 


Figure 8.3 
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Ker d 


X maps to 
x Kerd 


Isomorphism between G/ Ker(¢) and H. 


with the epimorphism ¢ : G > H. Now create a carbon copy of G, only collect 
all the elements of Ker(@) and hogtie them together into a single entity in the 
sketch of G/ Ker(@). Similarly, go to each coset of Ker(¢), take all its elements, 
and lump them together into a single entity in G/ Ker(@) to create a visualization 
of G/ Ker(@). We have a link between G and G/ Ker(@), and that is the function 
that maps x € G to x Ker(¢). It might be that @ is one-to-one, or maybe it is not. 
But the extent to which ¢ collapses elements of G down to single elements of H is 
precisely the extent to which elements of G clump together into cosets in G/ Ker(¢) 
(Exercise 8.6.18), which itself depends on the size of the kernel. The task is to show 
that G/ Ker(@) and H are isomorphic by finding the required mapping between 
them. We must map each coset in G/ Ker(¢) to an element of H, and the way to 
do this is to choose a coset x Ker(@), grab some element of it, say x, and send the 
whole coset to ¢(x) in H. In the following proof, we use - and *x to represent the 
binary operations in H and G/ Ker(@), respectively. 


Proof. We must find a bijection w : G/ Ker(@) > H such that 
Wlx Ker(¢) xx y Ker(¢)] = wlx Ker()] - Wly Ker(@)] (8.78) 
for all x Ker(@), y Ker(¢) € G/ Ker(@). So define w : G/ Ker(¢) — H by 


wix Ker()] = o(a) (8.79) 
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By Exercise 8.6.22, y is well defined on all of G/ Ker(@), is one-to-one and onto, 
and satisfies Eq. (8.78). 


EXERCISE 8.6.22 Finish the proof of Theorem 8.6.21 by showing that y: 
G/ Ker(¢) > H defined by Eq. (8.79) satisfies the following. 

(a) y is defined on all of G/ Ker(@) (F1). 

(b) wis well defined (F2). 

(c) wis one-to-one. 

(d) wis onto. 

(e) For all x Ker(@), y Ker(¢) € G/ Ker(@), 


Wx Ker() * y Ker()] = w[x Ker(¢)]-wLy Ker()] 


As a final note, the theorems from this section provide us with another logical 
equivalence to normality of a subgroup. If N is a normal subgroup of G, then 
there exists a group H and an epimorphism ¢: G — H such that Ker(@) = N. 
Also, if ¢: G > H isan epimorphism, Ker(@) is anormal subgroup of G. So given 
a group G and a subgroup N, N is normal in G if and only if there exists some 
group H and some morphism ¢ : G > H such that N = Ker(@). 


EXERCISE 8.6.23 Construct notation and Cayley tables to determine (up to 
isomorphism) all groups of five or fewer elements. 


EXERCISE 8.6.24 Find isomorphisms between the four-element groups you 
found in Exercise 8.6.23 and the following. 


(a) The multiplicative group S = {+1 + i} 
(b) The subgroup of S4 in Exercise 8.4.6 


EXERCISE 8.6.25 Describe all cyclic groups. 


EXERCISE 8.6.26 Both the dihedral group (Section 8.4) and the quaternion 
group (Example 8.1.18) have eight elements. Show that they are not isomorphic. 
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Rings 


We can create algebraic structures of greater complexity than a group by endowing 
a set with two binary operations and laying down some assumptions about how 
these operations behave, both on their own and in relation to each other. In this 
chapter we look at several such structures. Before we do, some explanation is in 
order about how we will proceed, for the theory of rings involves so many details 
that a road map will be very helpful. 

First, in Section 9.1, we will define the most general algebraic structure with 
two binary operations, a ring, and construct several important examples. At the 
same time, we will make a passing reference to fields, the most specialized kind 
of ring we will study. In Section 9.2, we define subring and construct a number of 
examples. In Section 9.3, we will look at several properties that the most general 
rings share. One particularly important class of rings can be created by adjoin- 
ing an element to a given ring; we devote Section 9.4 to this class of examples. 
In Section 9.5, we dive down inside a ring to look at specialized substructures 
of a general ring. Ideals, principal ideals, prime ideals, and maximal ideals are 
special types of substructures we will see there. In Sections 9.8-9.11, we will 
study four increasingly specialized kinds of rings: integral domains, unique fac- 
torization domains, principal ideal domains, and Euclidean domains. Each class 
of these structures is a proper subset of the class that comes before it, so as 
we progress, we will demonstrate (or at least refer to) examples that illustrate 
this. For example, we will see a ring that is not an integral domain, an integral 
domain that is not a unique factorization domain, and so on. In Section 9.14 we 
will look at ring morphisms, and finally, in Section 9.15, we will build quotient 
rings. 


Rings and Fields 


Rings Defined 


The simplest structure with two binary operations—and therefore the point 
where we begin—is called a ring. Because the assumptions we make about ring 
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operations so closely resemble those for addition and multiplication on the inte- 
gers, it is common to use the notations + and either - or juxtaposition for the 
two operations, and we will shamelessly call them addition and multiplication, 
respectively, even though by doing so we run some slight risks. One risk is that 
we might inadvertently think that some of the rings we create are more like 
the integers with their addition and multiplication than the assumptions justify, 
because the WOP makes the integers a very special kind of ring. So we have to 
be careful not to bring any excess baggage from our understanding of the inte- 
gers that the general ring assumptions do not imply. On the other hand, being 
able to envision the integers as a sort of quintessential example of a ring means 
that some of the results we proved for the integers will translate directly over to 
any ring and be clear to us right away. So some of the theorems we will state 
in this section will require very little in the way of a new proof but will mimic 
those for the integers with very little or no variation. The second risk we run 
in using notation already associated with the integers is that it might stifle our 
imagination when we try to create new and interesting rings. Creative minds 
have concocted quite a number of very interesting rings from interesting sets 
and definitions of equality, addition, and multiplication. We will see several of 
them. So here is the definition of ring, along with an enumeration of its defining 
characteristics. 


Definition 9.1.1 Suppose Ris a non-empty set endowed with binary operations 
+ (addition) and - (multiplication), such that R is an abelian group under addition, 
with additive identity 0 and additive inverse operation —, and multiplication is 
an associative binary operation that distributes over addition from the left and 
right. The algebraic structure (R,+, 0, —, -) is called a ring. If multiplication is 
a commutative operation, then R is called a commutative ring. If there exists a 
nonzero identity element for multiplication, we denote such an element e, and it 
is called a unity element or simply a unity. 


Let’s spell out the essential features of a ring in glorious detail. 


(R1) Addition is well defined (G1). 

(R2) Addition is closed (G2). 

(R3) Addition is associative (G3). 

(R4) There exists 0 € R such that a+ 0 =a for alla € R (G4). 

(R5) For alla € R, there exists —a € R such that a + (—a) = 0 (G5). 
(R6) Addition is commutative (abelian). 

(R7) Multiplication is well defined (G1). 

(R8) Multiplication is closed (G2). 
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(R9) Multiplication is associative (G3). 
(R10) For alla, b,c € R,a(b+c) = ab+ac and (b+ c)a = ba+ ca. 


Definition 9.1.1 requires that, if a ring has a unity element, it must be nonzero. 
Otherwise, the distributive property causes the entire ring to collapse to {0}, just 
as it does in the real numbers if 1 = 0. Since the trivial ring {0} is not particularly 
exciting in its internal structure, and since an occasional general result about rings 
with unity would not apply to the trivial ring, we simply insist for convenience that 
e is nonzero. 

Definition 9.1.1 does not mention the existence of multiplicative inverses. Cer- 
tainly, if a ring has a unity, some elements other than the unity might have a 
multiplicative inverse. We will address this issue in Section 9.3. 

We can only hope that a lot of our examples of groups and their binary oper- 
ations can serve as raw material from which to construct rings, or at least that 
new settings we will create can exploit binary operations whose basic features 
are transparent enough for us to make our way through verification of properties 
R1-R10 quick and relatively painless. Here are a few examples of rings. We will 
return to all of them several times in later sections. 


Example 9.1.2 The integers with addition and multiplication form a commuta- 
tive ring with unity element 1. Similarly, the rational numbers, real numbers, and 
complex numbers under the same operations are commutative rings with unity. 
The even integers are a commutative ring without unity. 


EXERCISE 9.1.3. For n >2, consider Z, with addition @, defined as in 
Eq. (8.39). Define multiplication ®, in an analogous way: 


[(1) + a] @n [(1) + 6] = (2) + ab (9.1) 


Table 8.2 illustrates the behavior of multiplication in Zs. From all our work in 
Section 8.3 on Z, as an abelian additive group, properties R1—R6 are satisfied. 
Show the following for Zp. 


(a) Addition and multiplication satisfy properties R7—R10. 
(b) Multiplication is commutative. 


(c) There exists a multiplicative identity. 


As we will see later, the value of n has much to do with which elements of Z,, have 
a multiplicative inverse. 

It would be a shame not to introduce an example of a matrix ring at this point, 
for matrices are very important in both theoretical and applied mathematics. They 
are the bread and butter of linear algebra and of many highly computational 
processes that have become practical only since the advent of computers. For our 
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purposes, they are a storehouse of examples that exhibit all kinds of interesting 
behaviors. 


Example 9.1.4 <A matrix is a two-dimensional array, typically of real numbers: 


a1 42 *** Ain 
a2,1 42.2 *** a2n 

A=| |. ae (9.2) 
Gm1 Gmj2 °*** GAm,n 


The matrix in Eq. (9.2) is said to have dimensions m byn, and the set of allm x n 
matrices with real number entries is denoted R,,,..,. If a matrix A has dimensions 
m X n, we sometimes write it as Am xn if we need to display its dimensions. Notice 
how entries of A are tagged with a row and column number and in that order. The 
commas in the subscripts are annoying to write and probably not necessary, so we 
will omit them if we can get away with it and still be clear. Thus a4. means the 
entry down in row 4 and across in column 2. Most of the matrices we will use will 
be 2 x 2, because we do not need to be any more general than that to construct 
some interesting examples. Mi 


Before we even discuss binary operations on sets of matrices, we need to 
define what it means for two matrices to be equal. We say two matrices, Am, xnj 
and Bm xn), are equal if two conditions are satisfied. First, the dimensions of A 
must be the same as those of B; that is, mj = m2 and n1 =n. Second, all their 
corresponding entries must equal as real numbers: aj, = bj, for all j and k. Clearly, 
this definition is an equivalence relation, for the fact that properties E1-E3 are 
satisfied for the dimensions and all the real number entries in the matrices involved 
reveals that our definition of matrix equality also satisfies properties E1-E3. 

The addition of matrices requires that they have the same dimensions, and 
A + Bis merely the matrix whose entries are the sums of the corresponding entries 
from A and B. For the 2 x 2 case, writing A = e ee | and B= & el 

421 a22 
we have 


(9.3) 


ait+by a+ diz 
A+B= 
ie + by, a2 + | 


Matrix multiplication is more complicated than addition. In general, in order 
for the product of two matrices to be defined, they do not have to have the same 
dimensions, but some relationship between their dimensions must be satisfied. 
Since two square matrices (n x n) can always be multiplied to produce another 
n X n matrix, we will define multiplication only for this case. The 2 x 2 case should 
get the point across, though we will explain it in more general language. Writing 


a=|% “| and Bee A (9.4) 


9.1 Rings and Fields 291 


we define AB in the following way. To calculate entry (j,k) (row j, column k) in 
the product, mentally highlight row j of A and column k of B. Mentally run your 
fingers across row j of A and down column k of B, multiplying the pairs aj; bi,, 
a jbo, etc; and add up these products. This sum of product pairs is entry (j, k) in 
AB. So for A and B in (9.4), 


_ jae+bg af+bh 
Ao bas id (9.5) 


EXERCISE 9.1.5 Let A= le “| and B = b al Calculate AB and BA. 
By Exercise 9.1.5, matrix multiplication is not commutative. With these 
definitions of matrix addition and multiplication, consider 


Rox2 = (|: 4 :a,b,c,d € R| (9.6) 


We show R2x2 is anoncommutative ring with unity element. That matrix addition 
is well defined (R1) is transparent and notationally tedious. If A = B and C = D, 
then showing A + C = B+ Damounts to nothing more than applying the fact that 
real number addition is well defined to each entry in A+ C and B + D. Similar 
drudgery reveals that matrix multiplication is well defined, so property R7 holds. 
Closure of addition and multiplication in R2,2 is immediate from the definitions, 
so that properties R2 and R8 hold. Associativity and commutativity of addition 
in the real numbers mean that properties R3 and R6 hold. The matrix denoted 0, 
with all zero entries, is the additive identity, and 


a b —a —b 
Te a= on 
The only remaining properties are R9 and R10, associativity of multiplication 
and left and right distributivity. These are not immediately obvious, but verifying 
them with all the necessary notation is about as exciting as counting stripes on the 
highway, and you are about as likely to make a mistake along the way. 


So R2x2 is a noncommutative ring. Does it have a unity element? Why yes it 
does. Writing 


1 0 0 0 
0 1 0 0 

Pee 0 1 0 (9.8) 
000 .--. I 


the matrix with 1s down the main diagonal and Os elsewhere, you can see that 
lhygA = Aly2 = A for all A € Rox. Finally, by exactly the same reasoning we 
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have used here, Rix» is a noncommutative ring with unity element /,x,, as are 
Qnxn and Zn xn. 


Here is an example of a ring that we can create from two given rings R and S. 
Verifying that this creation is a ring is both tedious and transparent, but you should 
at least mentally walk through the steps (or at least the first few of them) to see that 
it satisfies properties R1-R10. Rather than going crazy with notation to distinguish 
operations and elements of R from those of S, writing expressions like Or and +s, 
we will assume your acquired level of mathematical sophistication makes them 
unnecessary. Just make sure you notice which set all the operations are being 
performed in. 


Example 9.1.6 Suppose R and S are rings, and consider 
RxS={(,s):reR,seS} (9.9) 


the Cartesian product of R and S. Define (1, 51) = (r2, s2) in R x Sif = r2 and 
S| = s2. Define addition and multiplication in R x S by 


(1, 51) + (72, 82) = (11 + 12,51 +52) and (rq, 51)- (72, 52) = (11r2, 8182) 
(9.10) 


Then R x S is aring under the operations defined in Eqs. (9.10). The zero element 
of R x S is (0,0), and —(7, s) = (—r, —s). If R and S each have a unity element, 
then so does R x S,and erxs = (er, es). ff 


EXERCISE 9.1.7 Let R be the set of all functions defined on the real numbers. 
We use Definition 4.1.9 as our definition of equality. For two functions f and g, 
define their sum f + g by the rule (f + g)(x) = f(x) + g(x), and let composition 
serve as multiplication. Show that R with these operations is a non-commutative 
near ring with unity, in that one item in the list R1I-R10 fails. 


EXERCISE 9.1.8 LetUbea non-empty set and P(U) be the family of all subsets 
of U. Using symmetric difference as addition and intersection as multiplication, 
show that P(U) is a commutative ring with unity. 


Fields Defined 


It might seem strange to introduce our next term at this point, but it turns out to 
be more convenient as we progress through the theory of rings. A field is a special 
kind of ring, and its defining characteristics make it the most specialized kind of 
ring we will study in this text. Although we won’t delve deeply into a general 
theory of fields, we will notice a few of their characteristics that are pretty easy to 
pick up along our way. 


Definition 9.1.9 Suppose K is acommutative ring with unity, with the property 
that every nonzero element has a multiplicative inverse. Then K is called a field. 
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In addition to properties R1—R10, a field K must have the following features. 


(K11) There exists a nonzero element e such that e- k = k for allk € K (G4). 
(K12) For all nonzero k € K, there exists k~! such that k - k~! = e (G5). 


(K13) Multiplication is commutative (abelian). 


Notice that properties K11-K13 complete the requirements for K* to be an 
abelian group under multiplication. Thus a shorthand way of defining a field K is 
to say that K is an abelian group under addition and that K~ is an abelian group 
under multiplication. 


Example 9.1.10 The rational numbers and the real numbers are fields. Also, 
the complex numbers are a field. In Section 8.1, we showed that C with addition is 
an abelian group, and in Exercise 8.1.20, you showed that C* with multiplication 
isan abelian group. M# 


Example 9.1.11 Rox2 is not a field because multiplication is not commut- 
ative. I 


EXERCISE 9.1.12 Not only is multiplication in R22 not commutative, but also 
many elements do not have a multiplicative inverse. Show that lo | has no 
inverse by showing that the equation 


eels ais 


has no solution E AF 
c ad 


Subrings 


Suppose R is a ring and S is a subset of R. If S is also a ring under the same 
operations, we say that S is a subring of R. Demonstrating S is a subring, some of 
the properties R1-R10 are inherited from R, while some (the closure properties) 
must be shown for S. 


(S1) Sis closed under addition (R2, H1). 

(S2) S contains the additive identity (R4, H2). 
(S3) Sis closed under additive inverses (R5, H3). 
(S4) Sis closed under multiplication (R8, H1). 
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A subring S of a ring R is merely a subgroup of the additive group that is also 
closed under multiplication. We call {0} the trivial subring, and all subrings other 
than {0} and R itself are called proper subrings. 


Example 9.2.1 The integers are a subring of the rationals, and the rationals are 
a subring of the realnumbers. 


If R has a unity, it is not necessary that the unity be in S in order for S to be a 
subring of R. 


Example 9.2.2 Let m be a positive integer, and write mZ = {km : k € Z}, the 
set of integer multiples of m. This is another common notation for the set in 
Eq. (8.31), where we denoted by (m) the subgroup of the additive group of integers 
generated by m. Then mZ is a subring of the integers, because it is a subgroup of 
the integers under addition, and it is closed under multiplication. If m > 2, then 
mZ does not containl. li 


EXERCISE 9.2.3 Find all subrings of Z¢ and Z7. 
Example 9.2.4 Z2x2 isasubring of Rox2. 


EXERCISE 9.2.5 Calla square matrix diagonal if its only nonzero entries lie 
on the main diagonal. Let D2,.2 be the subset of Z2,2 consisting of the diagonal 
matrices. Then D2 x2 is a subring of Z,2. 


Example 9.2.6 Inthis example, we create a subring of the rational numbers that 
we will return to in Section 9.9. Let Qgp be the subset of the rational numbers 
whose denominators are odd. There is more than one way to denote an element 
of Qgp. The form 


m 
—— :mneZ 9.12 
{= +1 Oe) 
is an obvious way. But another useful way to denote the set is to exploit the prime 
factorization of the numerator and denominator, isolating 2 to keep it separate 
from all the other primes involved. This works for all elements except zero, which 
we throw in separately. 


pn Sige 
pO PIP2 "Pr sn © Wand pj, qj are odd primes 
q192°° 4s (9.13) 


for alll <i<rand1<i<s| 


Qop = {0} U ( 


The amount of repetition among the p; and gq; does not matter. And notice that 
the form of an element in Eq. (9.13) includes 1 by letting n = 0, r= s = 1 and 
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pi=q =3. Verifying that Qop has properties S1-S4 is quick. Using either 
form (9.12) or (9.13), we see that both addition and multiplication are closed 
because the product of denominators of two elements is odd. That Qgp contains 
0 and additive inverses is obvious. Thus Qgp is a subring of the rationals. 


EXERCISE 9.2.7 Show that the set of all rational numbers with even numerator 
and odd denominator is a subring of Qgp. 


Example 9.2.8 Let R be the near ring in Example 9.1.7, and let S be the set 
of all polynomial functions. Although it is notationally tedious to verify, S 
satisfies properties S1-S4 with respect to R. For the sum of two polyno- 
mials is a polynomial, the zero function 0(x) =0 is a polynomial function 
whose coefficients are all zero, and the additive inverse of a polynomial 
is a polynomial. Furthermore, the composition of two polynomials is a 
polynomial. Note that S contains the unity element, which is the identity 
function. Mf 


EXERCISE 9.2.9 Let Ubea non-empty set, and let A be a subset of U. Show 
that P(A) is a subring of P(U) from Exercise 9.1.8. 


If you are required to show that a given set S is a subring of R, you might 
save yourself some time if you exploit the fact that S1-S3 are merely the subgroup 
properties H1—H3 applied to R and S as an abelian additive group and subgroup. 
If you have already shown that (S, +, 0, —) is a subgroup of (R,+, 0, —), then a 
lot of your work in showing S is a subring of R is already done. Keep that in mind 
as you prove the following. 


EXERCISE 9.2.10 Suppose F is a family of subrings of a ring R. Then Nse¢8 
is a subring of R. 


Although subrings are very interesting in their own right, they are bland when 
compared to the special kind of subring called an ideal. Ideals come in all kinds 


of interesting flavors, and we will taste several in Section 9.5. 


EXERCISE 9.2.11 Define ep to be a right unity for a ring Rifr-er =r for all 
r é€ R. Similarly, call e7 a left unity if ez -r =r for allr € R. Let 


= ile 0| :a,be ef (9.14) 


(a) Show L is a ring by showing it is a subring of R22. 


(b) Find an element of L that is a right unity but not a left unity. 
(c) Show that this right unity is not unique. 
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Ring Properties 


Since a ring is an abelian group under its addition operation, all properties 
of abelian groups you proved in Chapter 8 apply to addition. With regard to 
multiplication and its interaction with addition, it would probably be a good idea 
to swing back through Chapters 2 and 3 and point out theorems that we proved for 
the real numbers that exploited only their ring properties. A lot of theorems will 
then translate directly over to a general ring. The only difference is that multipli- 
cation might not be commutative, so we have to state and prove certain theorems 
in two-sided language to get the full strength. We will state the theorems here, with 
appropriate comments along the way. You will prove some of them as exercises. 
The corollaries should be mere observations. 


EXERCISE 9.3.1 If Risa ring, thena-0=0-a=Oforallae R. 


Exercise 9.3.1 implies that zero will not have a multiplicative inverse in a ring with 
unity. 


EXERCISE 9.3.2 IfRisa ring and a, b € R, then 


(a) (—a)b = —(ab) 
(b) a(—b) = —(ab) 
(c) (—a)(—b) = ab 


Corollary 9.3.3 If Risaring with unity e, then (—e)a = a(—e) = —aforallae R. 


In an abelian group with operation , (a * b)~! = a7! * b7!. Since a ring is an 
abelian group under addition, this translates to the following additive form. 


Theorem 9.3.4 If Risa ring, then —(a+ b) = (—a) + (—b) for alla, be R. 


The distributive property extends nicely in a general ring to yield the following 
result analogous to Exercise 3.4.15 and Theorem 3.4.16. 


Theorem 9.3.5 If Risa ring anda, bj, b2,...b, € R, then 
ay eat Ok = Ya (abe) (9.15) 
Theorem 9.3.6 If Risa ring and aj, a2,...,4m,b1, b2,...bn € R, then 
(a1 a4) (Soke be) = Es (ear jhe) (9.16) 
As in a group, if a ring has a unity element, it can have only one. Your proof 


from Exercise 8.1.23 would translate directly over to a general ring, pretty much 
word for word. 
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Theorem 9.3.7 If Ris a ring with unity, then the unity element is unique. 


Even though elements of a ring with unity are not assumed to have multi- 
plicative inverses, some of them might. If for a given x there exists y such that 
xy = yx =e, then x is called a unit of the ring. Notice that Exercise 9.3.1 implies 
that zero is not a unit. 


EXERCISE 9.3.8 Find, with verification, all units in the following rings. 
(a) Z 

(b) Z12 

(c) Z7 

(d) D2x2 from Exercise 9.2.5 

(e) The near ring of functions from Exercise 9.1.7 


(f) The power set ring from Exercise 9.1.8 


Divisibility in a ring is defined in pretty much the same way it is in the integers, 
except that we must distinguish between left and right divisors. 


Definition 9.3.9 Let R be a ring, and let a and b be elements of R, where a is 
nonzero. Then a is called a left divisor of b provided there exists nonzero k € R 
such that ak = b. Similarly, a is called a right divisor of b if there exists nonzero 
k € Rsuch that ka = b. If R is a commutative ring and there exists nonzero k € R 
such that ak = b, we say simply that a is a divisor of b, or that a divides b, and we 
write a | b. 


If a divides b, it does not necessarily mean that the k such that ak = borka=b 
is unique. In a more specialized ring we will study in Section 9.8, however, we will 
have uniqueness. 


EXERCISE 9.3.10 Find nonzero elements a, b, ky, kz in each of the following 
rings where ak, = b and ak2 = b, but ky is different from kp. 


(a) Zi2 
(b) Z2x2 


Having defined divisors, we can now define what it means for an element of 
a ring to be prime, though we will not really look into any of its properties until 
Section 9.8. To motivate the term by returning to the positive integers, a prime 
p has exactly two distinct positive integer divisors, 1 and p. Thus by definition, 
1 is not prime. So if p is a prime number and p = ab is any factorization of p 
into positive integers a and b, then either a = 1 or b = 1. In the ring of integers, 
primes are extended to include the negatives of the prime natural numbers. Thus 
in the integers, for p to be prime means that if p = ab is any factorization of p into 


298 


Chapter9 Rings 


integers a and J, then either a or b is +1. In Exercise 9.3.8(a), you showed that 
the integer units are +1. In a general ring, a prime element is defined by using the 
language of divisors and units. 


Definition 9.3.11 Suppose R is a ring, and pis a ring element that is not a unit. 
Then p is said to be prime provided every factorization p = ab into ring elements 
a and b implies either a or bis a unit. 


Definition 9.3.11 automatically excludes zero from being prime, for 0 = 0-0, and 
zero is not a unit. 


EXERCISE 9.3.12 Find all primes in Z4, Ze, and Z7. 


In Exercises 2.1.8 and 2.1.19, you proved multiplicative cancellation for 
nonzero real numbers and the principle of zero products. In a ring, these proper- 
ties do not necessarily apply, either as part of the definition or as logical conse- 
quences of it. However, these properties will reappear in Section 9.8 when we 
discuss integral domains. From Definition 9.3.9, if ab = 0 while neither a nor b is 
zero, then a and b are called divisors of zero or zero divisors. 


EXERCISE 9.3.13 Find a zero divisor in each of the following rings. 


(a) Z, for some strategically chosen n 


(b) Z2%2 


EXERCISE 9.3.14 Let Rand Sbe rings. Find all units and zero divisors in R x S, 
in terms of the units and zero divisors of R and S individually. 


EXERCISE 9.3.15 Find all zero divisors in the power set ring from 
Exercise 9.1.8. 


EXERCISE 9.3.16 Show that if a is a divisor of zero ina ring with unity, then a 
is not a unit. 


EXERCISE 9.3.17 If Risa ring with unity and x is a unit, then the element y 
satisfying xy = yx = e is unique. 


A lot of the results in Section 2.2 involved ordering of real numbers as mea- 
sured by <. Since a ring does not necessarily have any such way of comparing 
its elements, none of these results has meaning in a ring without such a basis for 
comparison being defined. Thus rings do not necessarily have positive and neg- 
ative elements; there is not necessarily a way to measure the size of elements 
as absolute value does in the real numbers, and there is not necessarily a way 
to make the WOP applicable to subsets. And just because the equation x7 = —1 
has no real solution, it does not mean that the equation x? = —e will not have a 
solution. 
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EXERCISE 9.3.18 Determine all solutions to x2 = —1 in Zs, Zo, Z7, and Zi. 


Since a ring with its addition operation is an abelian group, we can apply the 
additive forms of the equations from Exercise 8.2.16. 


0a = 0 (9.17) 

(n+ l)a=na+a (9.18) 
(—n)a = —(na) (9.19) 
ma+na=(m-+n)a (9.20) 
m(na) = (mn)a (9.21) 
n(a+b)=na+nb (9.22) 


Be careful to distinguish between integers and ring elements in these equations. 
Also, note that Eqs. (9.20)—(9.22) are not the associative and distributive properties 
that characterize either the ring or the integers individually. 


Example 9.3.19 In Zox2,let A= : l and B = E oh Then 


4050 ae, 
i). Oro 
04 =o] | =o Ol (9.23) 
6 times 
12 1 6 12 
sa=| i A +[ i sels | (9.24) 
-6A=-64)=—| _§ le: 5 (9.25) 
i's 12 4 8 24 6 12 
4a¢2a=al | +2 [1 ‘lb et ol+ [2 ‘eal 6 3 
=6| | g|=84=4+2)4 (9.26) 
42ay=4] 3 ee . =s[_j g|=84= 4-294 (9.27) 


warpna({t of ee a]=["s s]=[-4 o]+[’s §] 


! Calculate all values of x? and see if —1 is ever produced. 
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EXERCISE 9.3.20 Suppose R is a ring with unity element e. Show that 
(me) (ne) = (mn)e for all positive integers m and n. 


Even though multiplication in a ring is not necessarily accompanied by an 
identity and inverses for elements, we can use the multiplicative forms of the 
definitions of a” in a limited way. For a ring element a, we begin by defining a! = a 
and a"tl =a". aforn>1.If Rhasa unity element e, we also define a° = e, but 
only for nonzero a. If ais a unit of R, we can define a~” = (a-1y". 


1 


Example 9.3.21 In Zox2,let A= |_ Then 


-1 -2 
A= A*x A= i Z| ete. | 


EXERCISE 9.3.22 Evaluate the first few (multiplicative) powers of 3 and 5 
in Zio. 


With these definitions of a” for appropriate integers n, and by arguments 
exactly like those in Exercise 3.5.4, we have the following. 


Theorem 9.3.23 Suppose R is a ring, a,b € R, and let m and n be positive 
integers. Then 

a" sd =a"™ (9.29) 

(a™)" =a™ (9.30) 

If R has a unity element and a is a nonzero ring element, Eqs. (9.29) and (9.30) 


hold for all nonnegative integers m and n. If a is a unit, these equations hold for 
all integers m and n. Furthermore, if R is commutative, 


(ab)" = a"b” (9.31) 

for all n for which a” and b” are defined. 
One big difference in the way we visualize the integers and Z,, is that the 
integers extend out indefinitely along the number line in both directions, whereas 


Z, is circular. If we generate both of these rings by considering 1,1+1,1+1+1, 
and so on, no expression of the form nl = )~;_, 1 ever produces a sum of zero 


9.4 


9.4.1 
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in the integers, but n1 = }“;_, 1 = 0 in Z,. In a general ring with unity element e, 
whether or not some expression ne = )“7_; e = 0 ever occurs motivates a term. 


Definition 9.3.24 Suppose R is a ring with unity, and suppose there exists a 
positive integer n such that ne = 0. Then the smallest such n for which this holds 
is called the characteristic of R, and is denoted char R. If no such positive integer 
exists, then R is said to have characteristic zero. 


In Z,, the fact that n =, 0, or n1 is zero in Z, means that char Z, <n. On the 
other hand, if m is a positive integer and m =, 0, that is, if m1 is zero in Z,, then 
mis a multiple of k,so that m > n. Thus char Z, > n,so that we have proved the 
following theorem. 


Theorem 9.3.25 Ifn>2,thenchar Z, =n. 
EXERCISE 9.3.26 What is char(Z4 x Z1g)? Explain. 


If adding the unity element to itself n times produces a sum of zero, then the 
same is true for all elements of the ring. 


EXERCISE 9.3.27. Let Rbea ring with unity and nonzero characteristic n. Then 
nx =O for allx € R. 


Ring Extensions 


We can create a very important type of algebraic structure from a given alge- 
braic structure by tossing in a new element, stirring well, and letting the mixture 
expand into another algebraic structure of the same type. It is called the process of 
adjoining an element in order to create what is called an extension of the original 
structure. In this section we want to get acquainted with the creation of exten- 
sions by adjoining elements to commutative rings. In principle, only two types of 
ring extensions can result from adjoining an element. We will begin with a very 
specific example of the first type, but instead of building it up as an extension of 
a certain ring in the most rigorous way, we will just lay the whole structure out 
there, define equality and the operations, and show that what we have presented 
is a ring. But don’t worry. We will make up for our lax introduction of this ring 
in Section 9.14, where we will see a more rigorous way to construct it. After we 
have presented our example of the first type of ring extension, we will point out 
how other extensions of the same type can be created through precisely the same 
reasoning. Finally, we construct the canonical example of the second type. We will 
use these constructions over and over throughout the rest of this chapter. 


Adjoining Roots of Ring Elements 


Example 9.4.1. Let S = {a+ b/2+cV/4: a,b,c € Z}, the set of all integer lin- 
ear combinations of {1, /2, /4}. First, we define x = aj +b, /2+c1~V/4 to be 
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equal to y=a2+ bo J/2 + V4 provided a; = a2, bi = b2, and cj = c2. Notice 
that this definition is an equivalence relation because it is just an application of 
integer equality in triplicate.” Define addition @ and multiplication © in a natural 
way, based on the extended distributive property and the behavior of </2 in the 
real numbers: 


(a+ bV24+cV4) @ (d+eV2+ fV¥4) = (@4+ad+bt+ev24+ (ct fyv4 
(at bV2 + cV4) © (dteV2+4+ fv/4) = 


(ad + 2bf + 2ce) + (ae + bd + 2cf)V2 + (af + be +.cd)V4 
(9.32) 


Because @ and © in S have the familiar behavior we expect when viewed within 
the context of the real numbers, we could simply use + and -. Just remember that 
a single entity in S is of the form a + b¥/2 + c</4 and includes partial forms like 1, 
6/2, or 4 — /4 by letting certain coefficients be zero. 

From Egg. (9.32) and closure of integer addition and multiplication, the clos- 
ure of © and © is immediately obvious. Also, S contains the additive identity 
0+ 0/2 + 0,/4, additive inverses (—a) + (—b)/2 + (—c)/4, and unity element 
1+0/2 + 0,/4. Furthermore, all remaining ring properties are assumed for the 
real numbers and are therefore inherited by S. Since © is commutative (next 
exercise), S is a commutative ring with unity element. Mi 


EXERCISE 9.4.2 Show that © in Example 9.4.1 is commutative. 


The notation we use for the ring in Example 9.4.1 is ZL V2], which is meant to 
denote that the ring of integers has had an additional element 2 thrown in, or 
adjoined, as we say. Adjoining an element to an algebraic structure is obviously 
different from unioning it onto the set. Instead of merely tossing it in as one more 
additional element, we toss it in, then combine it by addition and multiplication 
with itself and all other elements to expand into a ring. Thus you can see that the 
presence of 4 in Z[,/2] is necessary so that we have closure of multiplication: 
(/2)? = +/4. But the fact that (./2)? is an integer means that expressions of the 
form a+ b\/2 + cV4 are all that are necessary. By similar reasoning to that in 


This definition of equality will raise all kinds of concerns in the mind of your professor because of 
something you will probably just assume without any basis. You probably think that our definition of 
equality here exactly coincides with equality in the real numbers, so that two expressions in S are equal 
if and only if they are equal as real numbers. Clearly, if x = y as we have defined equality for S, then 
x = yas real numbers. But just because ay + by vp) fal V4 = aa +b2 V2 4 C2 V/4as real numbers, we 
cannot conclude immediately that a, = aj, by = bp, and cy = cp. If you crank out 5,096,516,652 — 
5184/2 + 91,047,715,794,/4 and 2,669,624,714 + 130,936,500,093 /2 — 11,347,811,196/4 on a 
TI-85 calculator, it appears they might be equal in the real numbers. They are not, and it is indeed true 
that equality in R implies equality in S. That is, if there is a way to write a real number in the form 
at b</2 +44, then there is only one way to do it. To prove this, you need to know more about Vp) 
and 4/4 as real numbers and their relationship to each other in the context of the integers. The term 
is linear independence, and you will see it in linear algebra. 
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Example 9.4.1, we could begin with the integers (or the rationals), choose a positive 
integer n and an integer x, define equality, addition, and multiplication, and show 
all ring properties (plus commutativity) for 


Zl x/x] = {ap tayy/x+ a x2 + eb an_1V x"! a; € Z} (9.33) 


The form of elements of Z[4/x] in Eq. (9.33) assumes that n is the smallest positive 
integer such that x” is an integer, so that none of the terms 2/x,..., Vx"~! is 
an integer. 

As another example, Q{V5] = {a + bV5 : a, b € Q} fits the form of Eq. (9.33). 
There is no reason x in Eq. (9.33) cannot be negative, and Z[/—1] = Z[i] = 
{a+ bi: a,b € Z} is called the Gaussian integers. Z[,/—5] turns out to be an inter- 
esting ring that we will look at in Section 9.8. Finally, R[i] = C. Notice that 
Z is the subring of Z[</2] consisting of all elements of the form a + b</2 + c</4 
where b=c=0. 

For the class of rings in the next theorem, we need to have a handle on the 
units when we get to Section 9.8. It is important to work through the proof of the 
next theorem because you will need to mimic the algebraic manipulation in an 
exercise that follows. 


Theorem 9.4.3 Suppose p is a positive prime integer. Then the only units in 


ZL/—p] are +1. 
Proof. Suppose a+ b./—p is a unit. We show that b = 0 anda = +1 by supposing 
(a+ b./—p)(c+d/—p) =1 (9.34) 


and drawing conclusions about a, b, c,d. Multiplying out the terms in Eq. (9.34) 
and using the definition of equality in Z[,/— p] yields 


ac— pbd=1 and ad+bc=0 (9.35) 
Squaring each side of Eqs. (9.35) yields 
a’c* —2pabed + p*b*d* =1 and ad? + 2abcd + b’c? = 0 (9.36) 
Multiplying the second equation in (9.36) by p and adding the two equations yields 
aos pa’d” + p’b?d* + pbc? =A 
a’ (c? + pd?) + pb*(pd? +c?) =1 
(a? + pb*)(c* + pd?) =1 (9.37) 


Each factor in Eq. (9.37) is a positive integer because the components are squared 
and p is positive. Thus by Exercise 2.2.7(f), 


a+pb?>=1 and c*+pd*=1 (9.38) 
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Furthermore, since p > 2,it must be that b = d = 0,so that a = +1. 


EXERCISE 9.4.4 Using i, j,k as in the quaternion group (Example 8.1.18), 
construct the ring extension Z[i, j,k], defining equality, addition, and mul- 
tiplication, then showing that all ring properties R1-R10 are satisfied. Is it 
commutative? 


EXERCISE 9.4.5 Show that S = {a+ 0i+ cj + Ok : a,c € R} is a subring of the 
ring from Exercise 9.4.4. 


EXERCISE 9.4.6 Finding the units in a ring amounts to solving the equation 
xy = 1. In Ze, the only units are 1 and 5, so the equation xy =¢ 1 implies x, y € 
{1, 5}. Use this fact and the technique in the proof of Theorem 9.4.3 to find all 16 
units in Z6[V2]. 


EXERCISE 9.4.7 Find, with verification, all units in Z[i]. 


EXERCISE 9.4.8 Prove that the following commutative rings with unity are 
fields by showing that every nonzero element is a unit. 


(a) Qlv2] 
(b) Ql] 


Polynomial Rings 


The other type of extension we want to create might seem fundamentally different 
from the previous ones, but the principle is the same. Its one notable difference 
is that the new element we adjoin is, in a sense, more foreign to the original 
ring than numbers like /—S are to the integers. The relationship of /—5 to the 
integers is characterized by the fact that (/—5)* = —5, which is an integer, or 
if you prefer, (./—5)* + 5 = 0. Similarly, if x = /1+ 2, then (x? — 133 -2=0. 
Thus, as with /—5, there is some way to manipulate x using only the ring elements 
and operations to produce zero. The term that describes this relationship of /—5 


and V1+ 2 to the integers is algebraic, and numbers that are not algebraic are 
called transcendental. For example, z is transcendental over the integers because 
there is no way to combine z and any finite set of integers using the ring operations 
a finite number of times to produce zero. Strict definitions of these terms will come 
in your later work in algebra. For now, we simply construct an example in which 
the symbol we adjoin is transcendental over the integers because we define the 
ring and the behavior of the symbol to make it so. 

Let R be a commutative ring, and write R[t] to mean the set of all polynomials 
in the variable t, where the coefficients are elements of R. That is, 


Rit] = {ant” + apt" | +++» +aytt+taoin € W, a € R forall k, 


(9.39) 
anda, 4 Oifn # 0} 
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We insist that a, be nonzero for n 4 0 because it would be silly to suggest that 
the highest power term of an element of R[t] is a,t” when its coefficient a, wipes 
it out. However, if n = 0, we do want to allow for ag = 0, the zero polynomial. 
We address an arbitrary element of R[t] as f, as if it were a function. We will not, 
however, write it as f(t). The reason is that we are not interested in an element of 
R{t] primarily as an expression into which we substitute numeric values for r, but 
more as a string of symbols whose behavior in the ring we are constructing involves 
the symbol ¢ merely as a way to describe how elements of R[t] add and multiply.? 
First, we define f = amt” +---+ ao and g= byt" +---bo to be equal if m =n 
and ax = by for all 0 < k <n, which is clearly an equivalence relation. Concerning 
the binary operations on R[r], define addition and multiplication of two elements 
in the familiar way of adding and multiplying two polynomials. Notationally, it 
is very ugly to state the definitions of addition and multiplication formally, but 
you're certainly familiar with the way they are done. Assuming this familiarity, 
let’s check that all ring properties R1-R10 are satisfied. 

First of all, the fact that R has properties RI-R3 and R6 means that R[t] does, 
too. Letting n = 0 and ag = 0 reveals that f = 0 (viewed as a polynomial in R[f] 
and not as a mere element of R) is the additive identity (R4). The existence of 
additive inverses in R makes property RS clear. Considering the way polynomials 
multiply, properties R7—R10 call on all the similar properties of both addition 
and commutative multiplication in R. Showing these is a mess but not difficult. 
Furthermore, multiplication in R[f] is commutative because R is commutative, and 
if R has a unity element e, the polynomial e is the unity element in R[r]. Thus R[F] 
is acommutative ring and has a unity element if R does. The new ring R[t] is called 
the polynomial ring over R. 

If we were to write an element of Z[V/2] as c(\/2)? + b/2 +a, we could say that 
elements of Z[</2] are three-term polynomials in the symbol </2. The fundamental 
difference between Z[/2] and R[t] is that the three-term polynomials c(/2)? + 
bs/2 + aare all that is necessary to have closure of the ring operations when </2 
is adjoined to the integers. The behavior of </2—that is, the fact that (</2)° is an 
integer—means it is not necessary to have terms of the form (/2)" for n > 3. But 
in R[t], polynomials can be of any degree, whatever that means. 


Degree of a Polynomial 


If f € Rit] is written as f = a,t" +---+ a9, where a, is nonzero, we define the 
degree of f to be n, and we write deg f =n. This definition does not assign a 
degree to the zero polynomial, so we will not assign a degree to it. Some authors 
define deg 0 = —oo. Even though —oo is not a real number, this degree assign- 
ment can make theorems involving deg f hold for the zero polynomial. Instead of 
assigning a degree to the zero polynomial and making it a special case in theorems, 
we will understand that polynomials whose degree we are working with are always 
nonzero polynomials. In R[t], the degree of a polynomial is a measure of its size, 


3 This is a lie. But for now, it’s true. See Section 9.12. 
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like absolute value in the real numbers, even though its properties do not really 
jibe with the norm properties N1—N3 (page 54). But as we will see later, it gives 
us at least some way to apply the WOP to nonzero elements of R[t]. 


EXERCISE 9.4.9 Suppose R is a commutative rin , and let f and g be nonzero 
Pp & 
polynomials in R[r]. Then the following holds. 


(a) If deg f = deg g =n, then deg(f + g) <n. 
(b) If deg f > deg g, then deg(f + g) = deg f. 
(c) deg( fg) < deg f + deg g. 


It is easy to see why part (a) of Exercise 9.4.9 is an inequality, for if f = 217 + 1 
and g = —2/? + 4¢ are elements of Z[rt], then f + g=4r+1. But the fact that 
part (c) is an inequality instead of an equation might seem strange. The existence 
of zero divisors in R allows for this funny behavior in R[f]. For example, in Z¢[t], 


(2? + 3)(3t + 3) = 6° + 617 +94 +9 = 31 +3 (9.40) 


So the degree of a product can be strictly less than the sum of the degrees of 
the factors. Equation (9.40) also illustrates that it is possible that one polynomial 
may divide another polynomial of lower degree. These unfamiliar idiosyncracies 
can happen because of the presence of zero divisors in R. In Section 9.8, these 
behaviors will go away when we look at the polynomial ring over an integral 
domain. 


EXERCISE 9.4.10 Calculate (2/2 + 4¢ + 1) + 3t +4) in Zs[f], then in Ze[¢]. 


EXERCISE 9.4.11 Give an example of a ring R and a polynomial in R[t] that is 
a divisor of zero. 


Ideals 


In the same way a normal subgroup is a special kind of subgroup that exhibits a 
characteristic stronger than closure, we define a special class of subring where we 
have something stronger than closure of multiplication. 


Definition 9.5.1 Suppose R is a ring and / is a subring of R with the property 
that rx € I for all x e J andr € R. Then J is called a left ideal of R. Similarly, if 
xr € I for all x € J andr R, then J is called a right ideal. If I is both a left and 
right ideal, it is called a two-sided ideal. If R is commutative, there is no distinction 
between left and right ideals, and we will simply use the term ideal. Also, for 
simplicity, if R is not commutative, then we will refer simply to an ideal to denote 
an arbitrary left or right ideal. 
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An ideal, say a left ideal, has properties $1-S4, but property S4 is replaced 
with the stronger property that rx € J for all x e J andr € R. What makes a left 
ideal more than a subring is that it “absorbs” multiplication from the left, and 
similarly for a right ideal. Thus an ideal is similar to a normal subgroup in that its 
elements exhibit a certain behavior in the presence of all ring elements. Let’s list 
the defining properties of a left (or right) ideal. 


(Y1) J is closed under addition (S1). 

(Y2) I contains the additive identity (S2). 
(Y3) J is closed under additive inverses (S3). 
(Y4) Forallx € Jandre R,rx € I (orxreé 1). 


The ideal {0} is called the trivial ideal. Also, a ring is an ideal of itself. All other 
ideals besides these are called proper ideals. 


Example 9.5.2 In Zz, the set {0, 4, 8} is an ideal, for sums and additive inverses 
of multiples of 4 are also multiples of 4, and any ring element times a multiple of 
4isamultiple of 4. 1 


EXERCISE 9.5.3 Let S be the set of all polynomials in Z[t] whose constant term 
is even. Then S is an ideal in Z[t]. 


Example 9.5.4 In Example 9.2.2 we showed that mZ is a subring of the inte- 
gers. If n is any integer and mk is an element of mZ, then (mk)n = m(kn), which 
is also an element of mZ. Thus mZ is a right ideal. Since integer multiplication is 
commutative, mZ is simply anideal. 


Example 9.5.4 is a special case of a type of ideal in a ring with unity. Starting 
with any integer m, we multiply m by every integer (in this case on the right) to 
create an ideal. Similarly in an arbitrary ring, we may take any ring element and 
create a left or right ideal by multiplying it on the left or right by every element of 
the ring. 
EXERCISE 9.5.5 Suppose R is a ring and a € R. Then the set 

Ra = {ra:reé R} (9.41) 
is a left ideal in R. (By similar reasoning, aR = {ar : r € R} is a right ideal in R.) 


EXERCISE 9.5.6 Construct 3Z7 and 3Z). 


If R does not have a unity element, then the ideals Ra and aR might not 
contain a. 
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Example 9.5.7 Let E be the ring of even integers. Then 
6E = {..., —24, —12, 0, 12, 24, ...} (9.42) 


is an ideal of E that does not contain6. I 


Example 9.5.8 For the polynomial ring Z[r], (3t-+2)Z[t] is the ideal of all 
multiples of 3f + 2 in Z[t]. Since Z[t] has unity, 3r-+2 ¢ Gr+2)Z[t]. 


All the previous examples of ideals are in commutative rings. The next 
exercise illustrates an interesting possibility in a noncommutative ring. 


EXERCISE 9.5.9 Let M= iE A a,be 2}. Show that M is a right ideal in 
Z2 x2 but not a left ideal. 


A result similar to Exercise 9.2.10 holds for ideals, but we must distinguish 
between left and right ideals. 


Theorem 9.5.10 Suppose F is a family of left (or right) ideals of a ring R. Then 
Nef I is a left (or right) ideal of R. 


EXERCISE 9.5.11 Prove the left-sided case of Theorem 9.5.10. 


In a commutative ring, where there is no difference between left and right 
ideals, Exercise 9.5.11 says that the intersection of a family of ideals is an ideal. In 
the next section, you will show that the intersection of a left ideal and a right ideal 
need not be either a left or right ideal. 


EXERCISE 9.5.12 Demonstrate a ring R and two ideals J; and Jy such that 
I, U bl is not an ideal in R. 


Although the union of two ideals is not necessarily an ideal, there is a theorem 
that will come in handy in Section 9.9 that says something about the union across 
a special family of ideals. 
Theorem 9.5.13 Suppose {J,}°°, is a family of left (or right) ideals of a ring 
with the property that J, C J,41 for all n. Then U™ (J, is a left (or right) 
ideal. 


EXERCISE 9.5.14 Prove the left-sided case of Theorem 9.5.13. 
EXERCISE 9.5.15 Let R be a commutative ring and Z the set of all zero 


divisors in R. What is wrong with the following proof that Z U {0} is an ideal 
in R? 
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Proof. Suppose R is a commutative ring and Z is the set of the zero divisors in R. 


(Y1) 


(Y2) 
(Y3) 


(Y4) 


Since Z U {0} satisfies properties Y1-Y4, Z U {0} is an ideal in R. 


Let z1, z2 € ZU {0}. If z1 = z2 = 0, then clearly z1 +z2 =O € ZU {0}. If 
precisely one of z1, z2 is zero, then without loss of generality, z; = 0 and 
z2 #0. Then zz is a zero divisor, so there exists nonzero a € R such that 
zja=0. Thus (z} + z2)a = za=O0, so that zj +22 is a zero divisor. If 
neither z; nor zz is zero, then there exist nonzero a1, a2 € R such that 
Za, = Z2a2 = 0. Thus 


(z1 + Z2)a1a2 = Z1414a2 + z2a2a1 = Oa2 + 0a = 0 
so that z; + z2 is a zero divisor. In any case z, + z2 € ZU {0}, so that Z U {0} 
is closed under addition. 
By definition, 0 € Z U {0}. 
Let ze ZU {0}. If z = 0, then —z = 0 € ZU {0}. If z £0, then there exists 


nonzero a € R such that za = 0. Thus (—z)(a) = —za = —0 = 0, so that —z 
is a zero divisor. In either case —z € Z U {0}. 


Let ze ZU {0}, andre R. If z=0, then rz =0e€ ZU {0}. If z £0, then 
there exists nonzero a é€ R such that za=0. Since a(rz) = r(az) = 0, it 
follows that rz is a zero divisor. 


EXERCISE 9.5.16 Demonstrate the error in the proof from the previous 
exercise by using appropriate elements from Zi. 


Generated Ideals 


Analogous to the subgroup generated by a subset of a group, we can define a 
similar term for ideals. 


Definition 9.6.1 Suppose R is a ring and A is anon-empty subset of R. Suppose 
Tis a subset of R with the following properties. 


(U1) ACI. 
(U2) J isa left ideal of R. 
(U3) If Jisaleft ideal of Rand A C J,then/ C J. 


Then / is called a left ideal generated by A and is denoted (A). 


We can similarly define a right ideal generated by A and denote it (A),. If R is 
commutative, we simply write (A) and call it the ideal generated by A. Does (A); 
exist? If so, is it unique? And if so, what does it look like? 
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EXERCISE 9.6.2 Suppose R is a ring and A is a non-empty subset of R. Let F 
be the family of all left ideals of R that contain all elements of A. Then (A), exists 
uniquely and can be written as 


(A); = () I (9.43) 
leF 


If our path through groups and subgroups suggests a direction for us to go 
from here, we would consider the left ideal generated by a single ring element a 
and show that the top-down form of (a); in Exercise 9.6.2 is equivalent to a form 
that can be built from the bottom up, by starting with a and building up to a subset 
of R that has properties U1—-U3. Let’s do that now. But in order for this program 
to be sure to work, R must have a unity element. 


Theorem 9.6.3 Suppose R is a ring with unity and a € R. Then (a); = Ra, the 
construction in Eq. (9.41), and (a), = aR. 


Proof. We prove for (a); only by showing that Ra satisfies properties U1—-U3. The 
proof for (a), is similar. First, since R has a unity e,a = ea € Ra, so that U1 is 
satisfied. Second, by Exercise 9.5.5, Ra is an ideal of R, so that U2 is satisfied. 
Finally, suppose J is any left ideal of R that contains a, and pick any x € Ra. 
Then x = ra for some r € R. But since J is an ideal of R anda € J, it must be that 
ra € J,so that Ra C J. Thus U3 is satisfied, and (a); = Ra. 


Example 9.6.4 Describe the left ideal of Z2,.2 generated by E Al 


Solution Let ke | be any matrix in Z2,2. Then 


d 
k 4 E 3| = B =| (9.44) 


Thus the left ideal is the set of all 2 x 2 matrices whose first column entries are 
even and whose second column entries are multiples of three. 


EXERCISE 9.6.5 Describe the right ideal of Z2,2 generated by the matrix in 
Example 9.6.4. 


Regardless of whether the presence of a unity allows (a); to be written in the 
form Ra, the left ideal generated by a single element a € R is called the principal 
left ideal generated by a. If R does not have a unity element, then Ra will not 
contain a. Thus the construction of (a) would be a bit more complicated than 
that in Theorem 9.6.3. In this text, (a) will always be in the context of a ring 
with unity. 
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EXERCISE 9.6.6 In the integers, what is (6) N (15)? 


EXERCISE 9.6.7. In Z[r], describe (1). 


EXERCISE 9.6.8 Find the principal left and right ideals generated by F i 


in Zo.2. 


a 0 


EXERCISE 9.6.9 LetM = 1E :a,b,ce€ 2}. Letr = E 3 pane consider 


the following subsets of M. 


rM = {rx: x € M} (9.45) 
Mr = {xr : x € M} (9.46) 


(a) Show that M is a ring by showing it is a subring of Z2,2. 


(b) By Exercise 9.5.5, the sets in Eqs. (9.45) and (9.46) are ideals in M. Show that 
neither is a subset of the other. 


(c) Show that rM N Mr is neither a left nor a right ideal in M. 


EXERCISE 9.6.10 Suppose R is a ring with unity, and suppose a is a unit of R. 
Show that (a); = (@),; = R. 


We can construct a form analogous to Eq. (9.41) for the ideal generated by 
a finite set {a1,a2,...,an}. To simplify the notation, we will denote this ideal 
(a1,.--,dn)- 


EXERCISE 9.6.11 Suppose R is a ring with unity and A = {aj,a2,..., a} is a 
subset of R. Construct a form of (A); that is analogous to Eq. (9.41) and show that 
it satisfies U1-U3. 


EXERCISE 9.6.12 Show that the ideal defined in Exercise 9.5.3 is actually (2, f). 


EXERCISE 9.6.13 Leta and b be nonzero integers, and let g = gcd(a, b). Show 
that (a, b) = (g). 


EXERCISE 9.6.14 Let U bea non-empty set, and consider the ring on P(U) 
with symmetric difference and intersection (Exercise 9.1.8). Let A be a subset of 
U. Describe the ideal generated by A. 


If R is a commutative ring with unity, there is an important link between the 
existence of proper ideals and the kind of ring R is. We will prove one direction of 
the next theorem, and you will prove the other. 


Theorem 9.6.15 Suppose R is a commutative ring with unity. Then R is a field 
if and only if it has no proper ideals. 
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(<) Suppose R has no proper ideals. Choose any nonzero x € R. We show that x 
has a multiplicative inverse. Since x is nonzero and R has no proper ideals, 
(x) = R, so that e € (x). By Theorem 9.6.3, (x) = Rx, so that there exists 
r € Rsuch that e = rx. Thus x has a multiplicative inverse, and R is therefore 
a field. 


EXERCISE 9.6.16 Prove the => direction of Theorem 9.6.15. 


Suppose / is a proper ideal of a ring R anda eé R — I. Similar to the way we 
adjoin an element to a ring to create an extension, we can construct a left or right 
ideal of R that contains a and all elements of 7. Our next result addresses the 
construction for the left-sided case. We will need this construction in Section 9.15 
in the context of a commutative ring with unity. 


EXERCISE 9.6.17 Let Rbea ring, J an ideal of R, and fix a € R— J. Let 
J={rat+i:reR,ie Tl} (9.47) 


Then J is a left ideal of R. 


Prime and Maximal Ideals 


If p is a prime number and p | ab, then either p|a or p | b (Exercise 2.5.11). 
Another way to say this is if there exists integer k; such that pk; = ab, then there 
will exist integer ky such that either pkz = a or pk2 = b. Using the language of 
the ideal in the integers generated by p and Theorem 9.6.3, yet another way to 
say this is if ab € (p), then either a € (p) or b € (p). Ina general ring, we assign 
a term to any proper ideal with this special property, whether or not the ideal is 
principal. 


Definition 9.7.1 Suppose R is a ring, and P is a proper ideal of R with the 
property that ab € P implies either a € P or be P. Then P is called a prime 
ideal of R. 


7. etermine with proof whether each of the following ideals is 
EXERCISE 9.7.2 D i ith proof wheth h of the following ideals i 
prime. 

(a) (6) and (7) inZ 

(b) (2,1) in Z[4] 

(c) @ in Ztq 

(d) The left ideal from Exercise 9.6.8 
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Even though we have not yet looked into properties of prime elements in a 
ring, we must be careful about jumping to conclusions about prime ideals and 
principal ideals generated by prime elements. In Section 9.8, we will see that if 
a principal ideal is a prime ideal, then its generator must be a prime element. 
However, just because an element is prime, it does not mean that it generates a 
prime ideal. We will illustrate an example of how this can happen in the next 
section. In Section 9.10, we will see that a prime element in a more special- 
ized kind of ring called a principal ideal domain will always generate a prime 
ideal. 

Any proper ideal of a ring is contained in a larger ideal, namely, the ring 
itself. But if an ideal is as large as it can be without actually being the entire ring, 
then we call it maximal. Since we are primarily interested in maximal ideals in a 
commutative ring, we set the definition in that context. 


Definition 9.7.3 Suppose R is a commutative ring, and / is a proper ideal with 
the property that if J is any ideal such that 7 C J C R, theneither J = Jor J = R. 
Then J is called a maximal ideal. 


One way to visualize a maximal ideal M is the following. If M is a maximal 
ideal, then for any r € R — M, the only ideal of R that contains r and all elements 
of M is R itself. 


EXERCISE 9.7.4 If p is a prime number, then the ideal it generates in the 
integers is maximal. 


EXERCISE 9.7.5 Explain why (4) is not maximal in the integers. 


2a 2b 


EXERCISE 9.7.6 In Z,2, the right ideal J = (3: nd 


]:aaeaez} is not 


2a 2b 


maximal, for J = {| Z 7 :a,b,c,dé€ z} is also a right ideal. 


EXERCISE 9.7.7 In the power set ring (Exercise 9.1.8), let Ag C U. Then 
P(U — Ao) is a maximal ideal if and only if Ao contains precisely one element. 


We see immediately that there is a relationship between prime and maximal 
ideals in commutative rings with unity. 


EXERCISE 9.7.8 If M is a maximal ideal in a commutative ring with unity, then 
M is prime. 


The reason that the ring in Exercise 9.7.8 has to have a unity element can be 
seen in the next exercise. 
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EXERCISE 9.7.9 Show that (4) is maximal but not prime in the ring of even 
integers. 


About now you should be asking either for a theorem claiming that prime 
ideals in commutative rings with unity are also maximal (so that the terms are 
logically equivalent) or for an example of a prime ideal that is not maximal. Well, 
they are not logically equivalent. 


EXERCISE 9.7.10 Demonstrate an example of a prime ideal that is not 
maximal.4 


When we restrict ourselves to principal ideal domains in Section 9.10, we will see 
that prime ideals are also maximal. 


EXERCISE 9.7.11 In Zx Z, is (5) x (3) prime and/or maximal? Verify your 
answer. 


EXERCISE 9.7.12 For non-empty set U and Ag C U, show that the ideal 
P(U — Ao) is prime if and only if Ao has precisely one element. 


Integral Domains 


A commutative ring with unity that has no zero divisors is called an integral 
domain, or domain for short. The absence of zero divisors means that the prin- 
ciple of zero products applies by definition. Thus the integers are a standard 
first example of a domain. Since all nonzero elements of a field are units and 
a unit is not a zero divisor (Exercise 9.3.16), a field contains no zero divisors 
and is therefore a domain. Thus the rationals, reals, and complex numbers are all 
domains. 


EXERCISE 9.8.1 If p is a prime number, then Z, isa domain.° 


Example 9.8.2 Z[V2] and Qop are domains, for as subrings of the real 
numbers, they are commutative, contain 1, and have no zero divisors. 


Example 9.8.3 If U is aset with more than one element, then the power set ring 
from Exercise 9.1.8 is not an integral domain, for every non-empty, proper subset 


of U is a zero divisor (Exercise 9.3.15). 1 


EXERCISE 9.8.4 If Disa domain, and if ac = bc andc £0, thena = b. 


4 You shouldn’t have to look far. 
5 See Exercise 2.5.11. 
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EXERCISE 9.8.5 Suppose D is a domain and a and b are domain elements such 
that a | b. Then the element & such that ak = b is unique. 


With Exercise 9.8.5, the following term becomes meaningful. 


Definition 9.8.6 Suppose D is a domain, a, b € D, and a is not a unit. If a | b, 
where ak = b and k is not a unit, then a is called a proper divisor of b. 


In a ring in which the result of Exercise 9.8.5 does not apply, such as those in 
Exercise 9.3.10, then Definition 9.8.6 cannot be unambiguously applied. For it is 
possible to have ak, = band ak2 = bin aring R where k, is a unit in R but kp is not. 


Example 9.8.7 In Zox2, 


adfo =f al-f la i} ow 


Now F i is a unit in Z>,2, for 
1 i}]1 -1 1 -1]}1 1 1 0 
lo i E let i E lee Hl a) 
; 2 1}. ays 
while E i isnotaunitinZ,.. 


EXERCISE 9.8.8 Show that E i is not a unit in Zo,2. 


Multiplicative cancellation in a domain is a logical consequence of the principle 
of zero products. We could have defined a domain as a commutative ring with unity 
where multiplicative cancellation holds for nonzero elements, and we could have 
then shown that the principle of zero products follows from that. In a commutative 
ring, the principle of zero products and multiplicative cancellation are logically 
equivalent. 


EXERCISE 9.8.9 Acommutative ring with unity and multiplicative cancellation 
is a domain. 


If a commutative ring with unity has a nonzero characteristic, then the only 
way it can be a domain is if the characteristic is prime. 


EXERCISE 9.8.10 If a domain has nonzero characteristic n, then n is prime. 


With Theorem 9.3.25 and Exercises 9.8.1 and 9.8.10, we see that Z,, is a domain 
if and only if n is prime. The only feature a field has that a domain might not is the 
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existence of multiplicative inverses. Thus there is only one thing to show in the 
next exercise. 


EXERCISE 9.8.11 A finite integral domain is a field.® 


By Exercise 9.8.11, we have that Z, is a field if and only if p is prime. 


EXERCISE 9.8.12 Find multiplicative inverses for all nonzero elements of Z7. 


In the integers, you showed in Exercise 2.5.5 that ifa | bandb | a,thena = +b. 
Furthermore, +1 are the units in the integers. In a general domain, we say that 
a and b are associates if a| b and b | a. Since this definition does not apply to 
zero, we declare zero to be an associate of itself. Notice that this declaration 
and the definition of associate prevent zero from having any other associates in 
a domain. 


EXERCISE 9.8.13 Show that 1 +i and 1—i are associates in the Gaussian 
integers Z[i]. 


EXERCISE 9.8.14 Ina domain, two elements a and b are associates if and only 
if there exist units uw and v such that a = ub and b = va. 


You should feel an equivalence relation coming on about now. 


EXERCISE 9.8.15 Let Dbea domain, and define a ~ b if a is an associate of b. 
Then ~ is an equivalence relation on D.’ 


Anytime you create an equivalence relation, it is natural to ask two ques- 
tions: What do the equivalence classes look like, and which element from each 
equivalence class might be a good choice as a representative element of the class? 


EXERCISE 9.8.16 If D is a domain and ~ is the equivalence relation of 
association, then [e] is the set of units of D. 


If a and b are associates, then there ought to be some senses in which they 
are interchangeable. One example of how this is true is in the integers, where 
(6) = (—6). Associates generate the same principal ideal. The best way to show 
this is first to prove the following. Its corollary is immediate. 


EXERCISE 9.8.17 Suppose a and b are elements of a domain. Then a | b if and 
only if (b) € (a). 


© To find a~!, define f(x) = ax and apply Exercise 4.6.11. 
7 Don’t forget zero. 
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Corollary 9.8.18 Ina domain, (a) = (0) if and only if a and b are associates. 


If ais a proper divisor of b, then by Exercises 9.8.5 and 9.8.14, a and b are not 
associates. So (b) € (a) by Exercise 9.8.17, but (a) 4 (b) by Corollary 9.8.18. Thus 
we have the following. 


Corollary 9.8.19 Ifa is a proper divisor of b in a domain, then (b) is a proper 
subset of (a). 


Now let’s look at the relationship between prime elements and prime ideals. 


EXERCISE 9.8.20 If Disadomainand (p) is a prime ideal, then p is prime in D. 


But what about the converse? If p is a prime element of a domain, must it 
generate a prime ideal? In the integers the answer is yes. Thanks to Exercise 2.5.11, 
if p is a prime number, then (p) is a prime ideal in the integers. But in the integers, 
Exercise 2.5.11 depends on the existence of greatest common divisors, which itself 
depends on the WOP. Strangely, in some domains a prime element can generate an 
ideal that is not prime, and a prime element p might satisfy p | ab, while p divides 
neither a nor b. The next exercise illustrates these possibilities. We provide the 
solution to the first part and leave the rest to you. We will return to this example 
in Section 9.9. 


EXERCISE 9.8.21 The ring Z[./—5] is a domain because it is a subring of the 
complex numbers and contains 1. However, you can show the following. 


(a) 3 is prime in Z[/—5]. 
Solution First note 
9=3-3= (2+ 7-52 - V-5) (9.50) 
To show 3 is prime in Z[./—5], suppose 
3 = (at bV—5)\(c + dV—5) (9.51) 
where a, b,c, d are integers. We show that one of these two factors must be 


a unit, which by Theorem 9.4.3 must be +1. Multiplying out the factors and 
using the definition of equality in Z[/—5] yields 


ac—5bd=3 and ad+bc=0 (9.52) 


Proceeding as in the proof of Theorem 9.4.3, we square each equation, mul- 
tiply the latter by 5, add and factor to have 


(a* + 5b*)(c? + 5d”) =9 (9.53) 
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Since both factors in Eq. (9.53) are positive integers, both must be in the set 
{1, 3, 9}. Considering each possibility reveals either a contradiction or the fact 
that one of the factors is a unit. Thus 3 is prime in Z[V—5]. 


(b) 2+ /—5S are prime in Z[/—5]. 
(c) 24+ /—5 ¢ (3). 


(d) There exists a prime element p € Z[./—5] such that (p) is not a prime ideal. 


(e) There exist a, b, p € Z[/—5] where p is prime, p | ab, but p divides neither 
anor b. 


In the polynomial ring over a domain, the degree of a product of polynomials 
behaves in a more predictable way than it might in the polynomial ring over an 


arbitrary ring. 


EXERCISE 9.8.22 Suppose D is a domain and f g € D[t] are nonzero polyno- 
mials. Then 


(a) deg( fg) = deg f + deg g. 
(b) If f | g, then deg f < degg. 


EXERCISE 9.8.23 The polynomial ring over a domain is a domain. 


The next exercise will help you see more clearly what it means for a polynomial 
in D[t] to be prime. 


EXERCISE 9.8.24 Let D be a domain whose set of units is U, and let K bea 
field. 


(a) What are the units of D[t]? 


(b) What are the units in K[t]? 


In a polynomial ring, we generally use the word irreducible instead of prime to 
refer to such a polynomial. 


EXERCISE 9.8.25 Show that f =?” —2is irreducible in Z[r]. Let Z[V2, t] rep- 
resent the polynomial ring over the domain Z[/2]. Show that f is reducible 
in Z[V2, ft]. 


EXERCISE 9.8.26 Is f = 20 — 4 reducible in Z[t]? In Q[r]? 


If f =a,t" +---+ ao is a polynomial in Z[rt] such that deg f > 1, we call 
gcd(ay, dn—1,--., a0) the content of f. If the content of f is one, we say that f is 
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primitive. Insight from Exercise 9.8.26 should make your proof of the following 
immediate. 


EXERCISE 9.8.27 Suppose f € Z[t] is irreducible and deg f > 1. Then f is 
primitive. 


Exercises 9.8.26 and 9.8.24 reveal that a result similar to Exercise 9.8.27 does 
not apply to Q[t]. That is, if a polynomial with integer coefficients is irreducible 
when viewed as an element Z/[t], then it is primitive. However, 2t + 6 is irreducible 
in Q[t] but has content 2. 


EXERCISE 9.8.28 Suppose R is aring and letr be anonzero element. Define f : 
R— Rby f(x) = rx. Show that f is not necessarily one-to-one. What additional 
condition on R guarantees f is one-to-one? Prove. 


EXERCISE 9.8.29 Suppose R and S are both domains. Does it follow that R x $ 
is a domain? 


Unique Factorization Domains 


Theorem 3.5.19 states that every integer greater than 1 has a unique factorization 
into positive integer primes. If we broaden the context to the integers, we lose 
uniqueness in most cases because we could introduce some negative signs here 
and there. But this is the only way we lose uniqueness. Even then, the prime 
factors in two different factorizations can be paired up as associates of each other 
(additive inverses), allowing us to say that every integer greater than 1 has a prime 
factorization in the integers that is unique up to order and association of the factors. 
For a negative integer n, applying the same principle to —n allows us to say that 
every nonzero integer that is not a unit has a factorization into prime integers, and 
this factorization is unique up to order and association of the factors. An integral 
domain in which every nonzero element has a factorization into prime elements, 
unique up to order and association of the factors, is called a unique factorization 
domain, or UFD for short. In a field every nonzero element is a unit, so a field is 
trivially a UFD. 

In our progression from more general to more specialized rings, UFDs are the 
point where we can show that any two nonzero elements have a greatest common 
divisor, unique up to association. We will adapt Definition 2.5.6 slightly to make 
it applicable to a general domain and then take a quick look at how we show 
existence and some sort of uniqueness of the greatest common divisor in a UFD. 
The first place we actually need the existence of the gcd is in a principal ideal 
domain (PID), where it is pretty easy to show. 

An important example of a UFD that we merely make reference to right now 
is Z[t]. To verify that Z[r] is a UFD takes some mathematical machinery that we 
will create in Section 9.11, where we will study Q[t] in some depth. Even though 
Zt] is a subring of Q[t], it is not as specialized a ring because there are limitations 
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on coefficients in Z[r] that do not apply in Q[t]. To show that Z[t] is a UFD, it helps 
to view it in the context of Q[r]. 

Since a UFD is by definition an integral domain, we do not need a theorem 
claiming that a UFD is a domain. But not all domains are UFDs. For example, 
Z[/—5] is a domain, and Eq. (9.50) shows that 9 has two distinct factorizations 
into prime elements. Thus Z[/—5] is not a UFD. 


Definition 9.9.1 Suppose D is a domain, and let a and b be nonzero elements 
of D. Suppose g is a domain element with the following characteristics: 


(D1) g|aandg|b. 


(D2) Ifh is any element of D with the properties that A | a and h | b, then it must 
be that h | g also. 


Then g is called a greatest common divisor of a and b and is denoted gcd(a, b). 


In Section 2.5, we said that a practical way you might find gcd(a, b) in the 
positive integers is by breaking down a and b into their prime factorizations, taking 
the appropriate number of 2s, 3s, and so on, and building the gcd from that. 
We did not prove that such a trick produces a positive integer with properties 
D1-D2 because the WOP provided an easier and more useful way. Alas, in a 
general UFD, the existence of gcd(a, b), unique up to association, must be proved 
by exploiting the unique prime factorizations of a and b in that somewhat sloppy 
way. We state the theorem here, followed by some details of the proof that might 
make the notation minimally sloppy. You will write the complete proof as an 
exercise. 


Theorem 9.9.2 Suppose D is a UFD and a and b are nonzero elements of D. 
Then there exists g € D that satisfies D1—D2, and if g; and g2 both satisfy D1-D2, 
then g; and g» are associates. 


Here are some suggestions on how to prove Theorem 9.9.2 and keep the nota- 
tion from getting outrageously complicated. If we break a and b down into prime 
factorizations, then let {p1, p2,..., Pn} be all the primes that appear in either a 
or b, we can write 


ry 2 


a= p;'P>.-. py" and b = ph! ph... phn (9.54) 


where some of the a, and 6; might be zero. If we let yy = min{ax, Bx} for all 
1<k <n, you can show that g = p}'p}’... pn" satisfies D1-D2. Since y= 
min{az, 6x}, we know that yz < az and yg < Bx for all k. This should make it easy 
to show that g has property D1. To show that g has property D2, suppose h | a 
and h | b. Then there exist ki, ky € Dsuch that hk, = a and hk2 = b. If the unique 
factorization of h is written as h = qi'qy os gn", then the prime factorizations of 
hk, and hk» must agree with Eqs. (9.54), so that some possible reordering of the px; 
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allows us to say that g, = p; foralll <k <m <n.Ifm <n, then letting 5; = 0 for 
m+1<k<ngivesush= p'! ae pe. But the fact that h | a implies that 5; < ax 
for all1 < k <n. Similarly, h | bimplies that 5; < 6; for all1 <k <n. Thus 5; < yx 
for all 1 < k <n, and by constructing the appropriate element of D, you can show 
h | g. Thus g has property D2. Showing g is unique up to association is surprisingly 
easy. In fact, the way you proved uniqueness of gcd(a, b) in the positive should 
translate directly over to D to imply that any g; and go that satisfy D1—D2 are 
associates. 


EXERCISE 9.9.3. Prove Theorem 9.9.2. 


Principal Ideal Domains 


Since an integral domain is commutative and has a unity element, left and 
right ideals are indistinguishable, and the principal ideal generated by an ele- 
ment a can always be written as Da. Furthermore, by Exercise 9.6.11, if A = 
{aj,..., An}, then 


(A) = {djay + dpa + +++ +dyan: dy € Dfor alll < k <n} (9.55) 


Let’s write this form of the ideal generated by A as Day + Daz + ---+ Dan. 

In some domains, principal ideals are the only ones there are. Every ideal 
will have a single generator. If a domain is such that every ideal is principal, it is 
called a principal ideal domain, or a PID for short. A field is trivially a PID, for 
Exercise 9.6.16 says that a field has no proper ideals. The trivial ideal in a field is 
generated by zero, and the field itself can be generated by any nonzero element. 

You have already seen an example of an integral domain that is not a PID, 
but before we point it out, let’s see why some of the domains we already know 
are PIDs, and derive some results for PIDs in general. Then the example of a 
domain (actually a UFD) that is not a PID will point itself out. First let’s review 
the integers a bit. 

In Chapter 2, after we defined divisibility and gcd in the integers, we showed 
that the gcd exists uniquely for all nonzero integer pairs (Theorem 2.5.7). 
Furthermore, it is the smallest positive element of 


S={ma+nb:m,ne€Z} (9.56) 


Thus our proof that gcds exist in the integers depended on the WOP to give us a 
number that we could show satisfies properties D1—D2. Furthermore, in showing 
that this smallest element of S divides both a and b, you employed the division 
algorithm. And you proved the division algorithm with the help of the WOP. 
Notice that S in Eq. (9.56) is precisely Za + Zb, the ideal generated by {a, b}. 
So gcd(a, b) is the smallest element of (a, b). Moreover, in Exercise 9.6.13, you 
showed that {a, b} and gcd(a, b) generate the same ideal in the integers, so that 
the ideal generated by a two-element set is really principal after all. Once again, it 
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is the WOP at work, leading you to the smallest positive element of an ideal, and 
allowing you to show that it generates the whole ideal. By looking in this same 
direction, you can show that the integers are a PID. The way you will attack this 
in the next exercise deserves a comment. 

To show that any particular domain D is a PID, you must show that every 
ideal can be written as Da, where a is a generator of the ideal that you must find. 
For the trivial ideal, that’s a piece of cake; otherwise, the ideal has a positive 
element. So the WOP comes in and gives you a generator on a silver platter 
(details notwithstanding). Thus the WOP is the basis for the integers being a PID 
and for its containing gcds. But the WOP depends on < as a measure of size of 
elements in the positive integers. 


EXERCISE 9.10.1 The integers are a PID. 


Another example of a PID is Q[t]. Rather than show this directly, we will 
show in Section 9.11 that Q[t] is a Euclidean domain and that all Euclidean domains 
are PIDs. 

A creative use of the WOP can supply you with a generator to prove the 
following. 


EXERCISE 9.10.2 Show that Qop is a PID. See the hint if you need help 
finding a generator.® 


Now suppose we’re working in a domain in which we don’t necessarily 
have a measure of element size, so that we have no way to apply the WOP. 
With the assumption that all ideals are principal, you can prove that gcds exist 
ina PID. 


EXERCISE 9.10.3 Suppose D is a PID and a and b are nonzero elements of D. 
Then gced(a, b) exists.” 


Since a generator for (a, b) qualifies as gcd(a, b) and since (a, b) = Da + Db, we 
have the following. 


Corollary 9.10.4 If Dis a PID and a and D are nonzero elements of D, then 
there exists m,n € D such that gcd(a, b) = ma+nb. 


Unfortunately, Exercise 9.10.3 says nothing about the uniqueness of gcd(a, b). 
But there is something close. 


EXERCISE 9.10.5 If gi and go are both greatest common divisors of a and b in 
a PID, then g, and go are associates. 


8 Of all nonzero elements of an ideal, pick one with a smallest power of 2 in the numerator. 
9 Let g be a generator of the ideal generated by a and b. 
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So even if there are several gcds of a and bd, they are all unit multiples of 
each other. If g is one gcd of a and b, then everything in its equivalence class of 
associates is, too. If [e] is the set of gcds of a and b, then we say a and bare relatively 
prime, and we note that if a and b are relatively prime, then there exists a linear 
combination such that ma + nb = e. 

Now let’s find that example of a domain (actually, a UFD) that is not a PID. In 
any commutative ring, a maximal ideal is prime (Exercise 9.7.8). But some prime 
ideals are not maximal, such as (f) in Z[t] (Exercise 9.7.10). And (#) is a proper 
subset of (2, f), the ideal from Exercise 9.5.3 of all polynomials with even constant 
term. The following result lets us close in for the kill. 


EXERCISE 9.10.6 A prime ideal in a PID is maximal. 


Corollary 9.10.7 Z[t] is not a PID. 


Proof. (2, t) is prime but not maximal in Z[¢]. 
In Section 9.11, we will show that Z[r] is a UFD. 


EXERCISE 9.10.8 Show that (2, /)isnota principal ideal of Z[t] by showing that 
every element fails to generate the ideal. 


Now we want to show that a PID is a UFD. It takes several steps to get there, 
but fortunately a lot of the uniqueness work has already been done for the posi- 
tive integers in the proof of Theorem 3.5.19 and translates over to a PID almost 
word for word. The sticky part in showing the existence of prime factorizations 
in a PID stems from the fact that the defining characteristic of a PID concerns 
its ideals, while the defining characteristic of a UFD concerns its elements. The 
link between them is the following. All ideals in a PID have a generator, and 
the way these ideals contain each other as subsets is tied by Exercise 9.8.17 to 
divisibility of their generators. Since in a PID a| b if and only if (a) 2 (bd), the 
way an element breaks down into factors is directly linked to the way principal 
ideals stack. So let’s make an important observation about the way ideals in a PID 
cannot stack. 

Let D be a PID and {J} , a family of ideals such that J, C J,+1 for all n. By 
Theorem 9.5.13, J = U%, J, is an ideal. Since D is a PID, / is principal and has 
a generator a. Now since a € UP. , In, a is in some J,. We claim that for all k > n, 
Ik = In. Suppose that k > n, and let x be any element of /,. Then x € J, so that 
x = ay for some y € D. But since /, is an ideal that contains a, then ay € J,. Thus 
I, CI,,so that i, = I,. 

The upshot of all this is that if you are in a PID and have a set of hypothesis 
conditions that allows you to create a family of ideals {/,}°°, where I, is a proper 
subsest of J,+1 for all, then you have produced a contradiction, and at least some 
of the hypothesis conditions are false in a PID. 

Now let’s show that every element of a PID can be written as a product of 
prime elements and that this factorization is unique up to order and association 
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of the factors. To prove the former, suppose a PID contains some element a that 
cannot be written as a product of primes. Then a is not prime itself. So there exist 
domain elements a, and b; such that a = a,b; and neither a, nor by is a unit. 
Furthermore, since a cannot be written as a product of primes, then a, or b; (or 
both) cannot either. Without loss of generality, we may assume aq is the culprit, 
which means that aj is not prime. And notice, since bj is not a unit, (a) is a proper 
subset of (a1) by Corollary 9.8.19. 

Now since a is not prime, there exist az and b2 such that a; = azb2 and neither 
ay Nor bz is a unit. Once again, since a; cannot be written as a product of primes, 
either a2 or bz cannot either. Assume it’s a2, and note that (a1) is a proper subset 
of (a2) because b2 is not a unit. Continuing in the same way, we can generate an 
infinite set of ideals {(a1), (a2), ...} where each (a,) is a proper subset of (ay+1). 
This is a contradiction, so the assumption that there exists an element that cannot 
be written as a product of primes is false, and all elements have a factorization 
into primes. We have therefore proved the following. 


Theorem 9.10.9 Every nonzero, non-unit element of a PID can be written as 
a product of primes. 


To show the uniqueness of the prime factorization, almost all the work trans- 
lates directly over from our work in the integers. The first step is analogous to 
Theorem 2.5.10, but the proof comes out a little differently because primes in a 
PID are defined in language that differs from that in the positive integers. 


Theorem 9.10.10 If Disa PID, a is nonzero, and p is prime, then either p | a 
or p and a are relatively prime. 


Proof. Let g = gced(a, p). Since g | p and p is prime, then either g is a unit or it 
is an associate of p. If g is a unit, then writing am + np = g and multiplying both 
sides through by g~! reveals that a and p are relatively prime. If g is an associate 
of p,then gu = p for some unit u,so that g = pu—!. Also, the fact that g | a means 
that pu-! | a,so that p | a. 


With Theorem 9.10.10 in hand, the proofs of the next two theorems would be 
identical to those for Exercises 2.5.11 and 3.4.19. 


Theorem 9.10.11 If Disa PID anda, b, p € Dare such that p is prime, a and 
b are nonzero and p | ab, then either p | aor p| b. 


Theorem 9.10.12 Suppose D is a PID, and p, aj, a2, ..., ad, € D are such that 
p is prime and aq #0 for all 1 <k <n. Thenif p | aja2...ay, then there exists 
k(1 < k <n) such that p | ag. 


The uniqueness result is analogous to Theorem 3.5.19, but the proof does not 
come out exactly the same because of the possible association of the factors. The 
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next exercise will come in handy when you write the proof in the exercise that 
follows. 


EXERCISE 9.10.13 If Disa domain, p € D is prime, and u € Dis a unit, then 
pu is also prime.!® 


EXERCISE 9.10.14 If D is a PID and a is nonzero and not a unit, then the 
prime factorization from Theorem 9.10.9 is unique up to order and association of 
the factors. 


With Theorem 9.10.9 and Exercise 9.10.14, we have the following. 
Theorem 9.10.15 A PID is a UFD. 
Finally, as a direct result of Theorem 9.10.11, we have the following. 


EXERCISE 9.10.16 If Dis PID and p is a prime element, then (p) is a prime 
ideal. 


Euclidean Domains 


Let’s return to the division algorithm (Theorem 2.4.7) to expand it and cast it in 
somewhat different language. In writing b = aq + r, we insisted that a be positive, 
so that 0 <r <a is meaningful. We did not have to make this restriction on a. 
Obviously, a must be nonzero, but we could have allowed a to be negative to have 
a slightly stronger theorem. It would look something like this. 


Theorem 9.11.1 (Extended Division Algorithm). Suppose a and b are non- 
zero integers. Then there exist unique integers g and r such that b = aq+r and 
0<r<|al. 


Proving to existence if a is negative is easy by appealing to existence for —a. 
In Theorem 9.11.1, we are using absolute value as a measure of the size of inte- 
gers and are saying that any pair of nonzero integers can be related by breaking 
one of them (b) down into a certain multiple (q) of the other (a), with some possi- 
ble stuff left over (7), but where the stuff left over is smaller in size than a. There 
are other important integral domains in which a notion of size can be imposed on 
the elements, and something akin to the division algorithm using that measure of 
size works for all nonzero elements. Since we do not need to apply the division 
algorithm to the zero element, we do not insist that zero be assigned a measure 
of size. 


10 Tf pu = abisa factorization of pu, then a(bu—) is a factorization of D- 
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Definition 9.11.2 Suppose D is an integral domain, and suppose there exists a 
function d : D* — W with the property that d(a) < d(ab) for all nonzero a and b. 
Such a function is called a valuation. Suppose also that for all nonzero elements 
aand b, there exist g,r € D,such that b = ag +r, and either r = 0 or d(r) < d(a). 
Then D is called a Euclidean domain. 


Demonstrating that a domain is Euclidean requires the creation of a valuation 
that assigns a nonnegative integer size to all nonzero elements of the domain. 
There might be more than one such valuation, but below we will point out some 
features a valuation must have as a result of the requirement that d(a) < d(ab). 
Also, given two nonzero elements a and b, b must either be a multiple of a, in which 
case r = 0, or be just the right distance from a multiple of a so that b = aq+rcan 
be written with r of sufficiently small size. 


Example 9.11.3 The integers are Euclidean, since absolute value may serve as 
a valuation in Theorem 9.11.1. 1 


Example 9.11.4 A field is a Euclidean domain. Let d(a) = 1 for all nonzero a, 
and note that d(a) = d(ab) for all nonzero a and b. Also, we may let g = ba! to 
have b = aq, so that r = 0 is always possible. Ml 


Divisibility in a field is trivial since nonzero elements are all multiples of each 
other. Exercise 9.11.7 will reveal that any valuation on a field must be constant. 


Example 9.11.5 The Gaussian integers Z[i] are Euclidean, and the function 
defined by d(a + bi) = a’ + b? can be shown to be a valuation that works. We will 
not provide a proofhere. 


Before we spend some quality time with one more very important exam- 
ple of a Euclidean domain, let’s derive some results about Euclidean domains 
in general. First, in the next exercise you will show that a Euclidean domain 
is a PID. If D is Euclidean and / is an ideal, then you must find some ideal 
element of which every element of J is a multiple. But that should not be too 
hard, for an element whose valuation is minimal will serve nicely as a genera- 
tor, and the division algorithm on D* ought to be just the tool to enable you to 
show it. 


EXERCISE 9.11.6 A Euclidean domain is a PID. 


If D is Euclidean, it is by definition an integral domain. Thus it has a unity 
element e, and for any a € D, it follows that d(e) < d(ea) = d(a), so that d(e) is 
minimal among all values of d. This fact and the division algorithm should come 
in handy when you prove the = direction of the following. 


EXERCISE 9.11.7. Ina Euclidean domain with valuation d, d(u) = d(e) if and 
only if u is a unit. 
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Exercise 9.11.7 sheds a little more light on some things we already know. First, 
since absolute value can serve as a valuation on the integers, we see that the 
units in the integers are precisely the values of x for which |x| = |1|, namely, +1. 
Also, since every nonzero element of a field is a unit, any valuation d must satisfy 
d(x) = d(e) for all nonzero x. Thus d must be constant. Conversely, if a valuation 
on a Euclidean domain is constant, then every nonzero element is a unit, so that 
the Euclidean domain is a field. 

As we have progressed from rings to domains to UFDs to PIDs to Euclidean 
domains, we have claimed that there exist examples of one structure that do not 
qualify as an example of the next most restrictive structure. In every case up to 
now, we have provided an example complete with proof, except for Z[t]. We have 
shown that Z[t] is not a PID (Corollary 9.10.7), but we have not yet shown it is a 
UFD. We will do this in the next section. 

About now you should be asking, “Where is my example of a PID that’s not 
Euclidean?” Well, this is the only place in our progression from more general to 
more specialized rings where we are going to present an example of a PID that is 
not Euclidean and will give only a loose explanation of how this would be shown. 
The classic example of a non-Euclidean PID was first constructed by T. Motzkin in 
1949—very recently indeed by mathematical standards. It is Z[(1 + /—19)/2], the 
set of all expressions of the form m + n(1 + /—19)/2, where m and n are integers. 
For convenience, let’s write a = (1 + /—19)/2, and discuss how one goes about 
showing Z[a] is a PID that is not Euclidean. 

First, we address the fact that Z[a] is not Euclidean. We would do this by 
showing that every Euclidean domain has a certain feature, then showing that 
Z[a] does not have this feature. If D is any Euclidean domain that is not a field, 
then there will exist nonzero, non-unit elements. If d is a valuation, then a nonzero, 
non-unit element x will satisfy d(x) > d(e). Among all elements of D, let a be a 
nonzero, non-unit element for which d(a) is minimal. Such an element is called 
a universal side divisor. By the division algorithm on D, any nonzero x can be 
written as x = aq+r, where either r= 0 or d(r) < d(a). Since d(a) is minimal 
among all nonzero, non-unit elements of D, we may say that r is either zero or a 
unit. Every Euclidean domain that is not a field will have universal side divisors 
because the valuation will not be constant. 

Next we need to know what the units are in Z[a]. It turns out that the only 
units in Z[a] are +1. So let’s suppose Z[a] is a Euclidean domain and see how 
this leads to a contradiction. Since the only units in Z[a] are +1, Z[a] is not a 
field. Thus there exists a universal side divisor a € Z[a], and every x € Z[a] can be 
written as x = aq+r, where r € {0, +1}. In particular, x = 2 must be writable in 
this way. So aq = 2 — r, and the only possible values of 2 — r are {1, 2, 3}. Thus a 
divides at least one of {1, 2, 3}, but since a is not a unit, a does not divide 1. With 
some work, we could show that the only divisors of 2 are {+1, +2} and the only 
divisors of 3 are { 3}. Thus a € {+2, +3}. But letting x = a, it turns out that 
there is no q € Z[a] for which a = aq +r, given that a € {42, +3} andr € {0, +1}. 
This is a contradiction, so Z[a] is not a Euclidean domain. 

Now we address the fact that Z[q] is a PID. First, let’s take the defining char- 
acteristic of a Euclidean domain and state it first in the original way and then in 
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an altered, but equivalent, form. A domain D is a Euclidean domain if there exists 
a valuation d on D™ such that for all nonzero a and b, 


1. There exist g,r € D such that b = aq +r and either r = 0 or d(r) < d(a). 


2. Either b is in the ideal generated by a, or it is possible to subtract from b some q 
multiple of a to produce an element r = b — aq whose valuation is smaller than 
that of a, that is, d(r) < d(a). 


Let’s weaken this second form a bit. Suppose D is a domain with a valuation 
such that for all nonzero a and b, either b is in the ideal generated by a, or there 
is some linear combination of a and b whose valuation is smaller than that of a. 
That is, either b € (a) or there exist nonzero domain elements m and n such that 
d(ma + nb) < d(a). This property is sort of like the division algorithm, but not 
quite as strong. In the event that b is not a multiple of a, we do not insist that 
some multiple of a can be subtracted from b to produce an element of small 
valuation, but only that some linear combination of a and b has sufficiently small 
valuation. A valuation with this feature is called a Dedekind-Hasse norm, and if a 
domain is such that there exists a Dedekind-Hasse norm, then you can show the 
following. 


EXERCISE 9.11.8 Suppose D is a domain with a valuation d and with the prop- 
erty that, for all nonzero a and b, either b € (a), or there exist domain elements m 
and n such that d(ma + nb) < d(a). Then D is a PID.!! 


The trick to showing that Z[a] is a PID is to show that Z[a] entertains a 
Dedekind-Hasse norm. A valuation that works is d(a + ba) = a? + ab + 5b’, for 
which we will not provide any details. 


Polynomials over a Field 


An important example of a Euclidean domain is Q[t], and we want to address 
this now. Every nonzero polynomial has a nonnegative degree, which is precisely 
the valuation we will use. Since deg( fg) = deg f + deg g, and since deg g > 0, we 
have that deg f < deg( fg). Showing QJ[t] is Euclidean then boils down to proving 
a sort of division algorithm. So here is the theorem we need, with uniqueness of 
q and r to boot. The technique we use to get the proof off the ground should look 
surprisingly familiar. You will provide a few of the details in an exercise to follow. 


Theorem 9.12.1 Suppose f and g are nonzero polynomials in Q[r]. Then 
there exist unique polynomials g and r such that g = fq +r and either r = 0 or 
degr < deg f. 


Show that an arbitrary ideal / is principal by letting a € J be such that d(a) is minimum. You can then 
show that any b € J must be a multiple of a. 
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Proof. Let f and g be nonzero polynomials, and define 
S={g— fa:q€ QI} (9.57) 


If the zero polynomial is in S, then we have g = fq. Otherwise we may let r be any 
polynomial in S whose degree is minimal. Then r = g — fq for some polynomial 
q,so that g = fq+r.Toshow degr < deg f, suppose degr > deg f. Then we may 
write 


f =amt”™ +-+-a9 and r=Dyt"+---bo (9.58) 
where m <n, and neither a, nor b, is zero. By Exercise 9.12.2, we may create a 


nonzero element of S whose degree is strictly less than the degree of r, which is a 
contradiction. Also by Exercise 9.12.2, g and r are unique. 


EXERCISE 9.12.2 Finish the proof of Theorem 9.12.1 by showing the fol- 
lowing. 


(a) The polynomial rj = g — fq — (bn/am)t"-" f is an element of S such that 
degr; < degr. 


(b) If g = fai +r; and g = fq2 +12 where r; = 0 or degr; < deg f and where 
ra = 0 or degra < deg f, then qi = q2 and ry = 12.7 


Having shown that Q[t] is Euclidean opens a floodgate of interesting facts 
about this very important ring. First, unlike Z[t], Q[t] is a PID, so every nontriv- 
ial ideal has a nonzero generator, and its degree is minimal among all elements 
of the ideal. Furthermore, from our comments before the statement of Exer- 
cise 9.11.6, any polynomial in the ideal whose degree is minimal will generate 
the ideal. If we write f = ajt" + Gpit”— | +++. + ag, then a, 4 0 and we can cre- 
ate an associate polynomial m = a =1?+ (aga taee + --++a0/dyn whose 
leading coefficient is one. Since m and f have the same degree, they generate 
the same ideal. A polynomial whose leading coefficient is the unity element 
of the ring of coefficients is called monic, and we have therefore argued the 
following. 


Theorem 9.12.3 Every ideal in Q[t] has a monic generator. 


Since Q[t] is Euclidean, it is also a UFD, so that every nonzero polynomial 
factors into irreducible polynomials. Furthermore, this factorization is unique, at 
least to a point. Any two factorizations that appear to be different can be seen 
upon inspection to have components that pair up as associates. Now associates 
are unit multiples of each other, and from Exercise 9.8.24(b), the units in Q[t] are 


12 What can you say about deg(r) — r,)? What does Exercise 9.8.22 allow you to conclude? 
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the polynomials of degree zero. Thus two polynomials are associates if and only 
if one is a nonzero constant multiple of the other. 


Example 9.12.4 ‘Two factorizations of P43 +42 are t(t+2)(t+1) and 
(41)(3t + 4)(3t + 3). Associate pairs are t and 4t, f+ 2 and zt + i, and ¢+1 and 
Bee 3 

st+7. 

qi g 


We have said a lot about irreducible polynomials in Q[f], but we have not de- 
veloped criteria by which we can determine whether a polynomial is reducible 
or irreducible. By Exercise 9.8.22, if deg f =1 and f = gh, then either g or / has 
degree zero and is therefore a unit. Thus polynomials of degree one are irreducible. 

Another simple criterion for irreducibility involves evaluating the polynomial 
at some rational number. In Section 9.4, we said we were not really interested in 
polynomials as functions where we would plug in values for t, but more as a string 
of symbols. Well, that statement is now false. As a ring in and of itself, a polynomial 
ring is exactly as we described it in Section 9.4, and ¢ is indeed just a formal (and 
foreign) symbol whose presence is used to define the addition and multiplication 
of two elements. But for a given polynomial f there are some pretty good reasons 
we might want to choose a specific ring element w and calculate the value of f(a). 
The ring element f(w) can make an important statement about the polynomial 
f and its role in the polynomial ring. 

You spent time in high school algebra trying to factor polynomials, and one 
way you might have stumbled onto a linear factor of f was by discovering some 
number a such that f(a) = 0. For example, if f = t? — 1? + 2t — 2, you might have 
noticed that f(1) = 0 by adding the coefficients, so you could write f = (t — 1)g, 
which by division became f = (t — 1)(t2 + 2). The reason this worked was the 
following exercise. 


EXERCISE 9.12.5 Suppose f is a polynomial in Q[¢] and a is a rational number. 
Then (t — a) | f if and only if f(@ = 0. 


Exercise 9.12.5 makes the following almost immediate. 


Theorem 9.12.6 Suppose f is a polynomial in Q[t] whose degree is either 
2 or 3. Then f is reducible if and only if there exists a rational number a such 
that f(a) = 0. 


Proof. Let f be a polynomial in Q[t] of degree 2 or 3. 


(=) Suppose f is reducible. Then f = gh, where the degrees of g and h are 
nonzero, but strictly less than deg f. Without loss of generality, we may 
assume deg g = 1, so that g = at+ b for rational numbers a and b, where 
ais not zero. Thus f(—b/a) = g(—b/a)h(—b/a) = 0. 


(<) Suppose there exists rational a such that f(a) =0. By Exercise 9.12.5, 
(t—a) | f,so that f is reducible. 
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Whether a polynomial with rational coefficients is reducible is not an easy 
question to answer in general. Several criteria can help answer the question for 
certain special polynomials, and you will see them in your upper level algebra 
class. 

If there is one important idea that you should have clued into by this point in 
the mathematical game, it is that certain properties of mathematical structures are 
the logical basis for other properties and that these properties can sometimes be 
isolated and translated over to other structures that might be very different. For 
example, we proved in Theorem 2.1.11 that a- 0 = 0 for all integers a. Then we 
noted in Section 9.1 that a. 0 = 0-a = 0 in any ring by an argument identical to 
that for the integers. A few basic ring properties come together to make this true 
(basic properties of addition, including additive cancellation, and the distributive 
property), even without commutativity of multiplication. So any mathematical 
structure in which we have additive cancellation and the distributive property will 
be a structure where a-0=0-a=0. 

The fact that Q[t] is a Euclidean domain calls on the fact that the ratio- 
nal numbers are a field, but it does not require that they be any particular 
kind of field. If we replace the rationals with an arbitrary field K in Theorems 
9.12.1-9.12.6, exactly the same proofs work. This can be particularly interesting 
if we use the field Z, for prime number p. Here are restatements of Theorems 
9.12.1-9.12.6 for an arbitrary field and some examples to illustrate their broader 
application. 


Theorem 9.12.7 Let K bea field. Then K[f] with valuation deg f is a Euclidean 
domain. 


EXERCISE 9.12.8 


(a) Let fp =307 +142 and gy =344+3°+4242t+4 be polynomials in 
Zs5[t]. Show that there exist g and r in Zs[t] such that g; = fig+r and 
degr < deg f;.° 


(b) Let fo = 2r+2 and gy =? be polynomials in Z¢[t]. Show that it is impos- 
sible to write g2 = foqg+r for any g and r in Ze¢[t] where either r= 0 or 
degr < deg fo. 


Theorem 9.12.9 Let K be a field, f € K[t] anda eé K. Then (t — a) | f if and 
only if f(a) = 0. 


Theorem 9.12.10 Let K be a field, f € K[t] and suppose the degree of f is 
either 2 or 3. Then f is reducible if and only if there exists a € K such that 
f(a) = 0. 


13 Use polynomial division. 
14 The coefficient of 72 in fq must be one. Show this is impossible. 
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EXERCISE 9.12.11 Apply Theorem 9.12.10 to the following polynomials 
in Za[t] by either finding a proper factorization or explaining why they are 
irreducible. 


(a) ff=ertttl 
(b) fP=r+r+2 
(c) RP=P4+r42 
(d) fa=Pt+rt2 


Polynomials over the Integers 


We have waited to show that Z[t] is a UFD, and now is the time to tackle the 
question. We could have shown the existence of a factorization of a polyno- 
mial in Z[t] into irreducibles earlier, but uniqueness of this factorization up to 
order and association of the factors requires us to view elements of Z[t] as ele- 
ments of Q[t], where factorizations are unique up to association. The reason 
we have some work to do to show uniqueness is that reducibility and associa- 
tion in Q[t] are different from reducibility and association in Z[r]. Polynomials 
such as 2t+ 6 and 10¢ + 15 are irreducible and associates in Q[t], but not in Z[r], 
because the constant polynomials 2 and 5 are units in Q[f] but not in Z[r]. First the 
easy part. 


EXERCISE 9.13.1 Every nonzero, non-unit polynomial in Z[t] has a factoriza- 
tion into irreducible polynomials in Z[¢].° 


To make our way to uniqueness up to order and association, we need the 
following lemma. You will provide the climactic detail as an exercise. Remember 
that the term primitive applies only to polynomials of degree at least one. 


Theorem 9.13.2 (Gauss’s Lemma). In Z/[t], the product of two primitive 
polynomials is primitive. 


Proof. Suppose f and g are primitive polynomials in Z[t]. We show that fg is 
primitive by supposing p is any prime number, and then showing there is some 
coefficient in fg that is not divisible by p. 

Suppose p is a prime number, and write 


f =ant"™ +---+a9 and g=bdyt"+---+b9 (9.59) 


Use strong induction on the degree of f and mimic the proof of Theorem 3.5.19. Exercise 9.10.1 takes 
care of the case deg(f) = 0. 
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Since f and g are primitive, there are coefficients in both f and g that are 
not divisible by p. Let a; and b; be, respectively, the coefficients of the low- 
est powers of t in f and g that are not divisible by p; that is, p divides all 
Of ao, a1, ...,4j-1, 50, b1,..., bk-1, but not aj or by. Then by Exercise 9.13.3, 
p does not divide the coefficient of t/+* in fg. Since this is true for all prime 
numbers, fg is primitive. 


EXERCISE 9.13.3 Finish the proof of Theorem 9.13.2 by showing p does not 
divide the coefficient of t/+* in fg.!® 


The next theorem says simply that if a polynomial has integer coefficients, 
and it can be factored into polynomials of degree at least one by viewing it as 
an element of Q[t] and resorting to rational coefficients in the factors, then you 
can adjust these factor coefficients and make them integers. We will prove this 
in a somewhat conversational way to keep the notation from getting too sloppy. 
Notice the point at which we apply Exercise 9.13.3. 


Theorem 9.13.4 Suppose f is a polynomial with integer coefficients. If f is 
reducible in QJt], then it is reducible in Z[t]. 


Proof. Suppose f has integer coefficients, and suppose f = fi f2, where f; and 
f2 are polynomials with rational coefficients, and both f; and fo have degree at 
least 1. Let d, and dz be, respectively, the product of all the denominators of all the 
coefficients of f; and f2; then let g, = df; and go = dy fz, so that g; and gz have 
integer coefficients. Next, let cy and cz be the content of g; and gz, respectively, and 
factor these out to write gi = cif, and g2 = czh2, so that h1 and hz are primitive. 
Also, if we let c be the content of f, we may write f = cg, where g is primitive. 
Thus we have 


(cddz)g = (didz) f = di fidz fz = 8182 = (cica)hih2 (9.60) 


where g,/41, and hz are all primitive polynomials. By Exercise 9.13.3, hih2 is 
also primitive, so the content of the left-hand side of Eq. (9.60) is cd,d, and 
the content of the right-hand side is cyc2. Since cd,d) and c,c2 are both positive 
integers, cd,d) = cyc2, and g = hyho. Thus f = cg = chyhy, and we have a proper 
factorization of f with integer coefficients. 


Now the result we have all been waiting for. 


Theorem 9.13.5 The factorization in Theorem 9.13.4 is unique up to order and 
association of the factors. 


The coefficient of t/t* is yo ajbj4x-i. Look separately at the terms 0<i< j—1, i= j, and 
jti<i<jtk. 
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Proof. Suppose f € Z[t] can be written as 


f= Pipe es Pa = NG2 i Gh (9.61) 


where all px and gx are irreducible polynomials with integer coefficients. Ifn = 1, 
then m = 1 and p; = q1, so that the result is trivially true. 

So suppose n > 2 and that the result is true for all polynomials of degree 
less than n. Note that all pz and q, are either prime constant polynomials or 
primitive. Since pm | qi... dn, then viewing p» and all g, as elements of Q[r], 
irreducible in Q[t] by Theorem 9.13.4, we have that pm | qx (in Q[t]) for some k 
by Theorem 9.10.12. Reordering the g,, we may assume pm | gn, and since qy 1s 
irreducible in Q[t], we may write (a/b) pm = qn for some rational number a/b. 
Thus ap» = bqn, and since p,, and q, are primitive, a = +b. Therefore g, = 4pm, 
so that p,, and qg, are associates in Z[t]. Substituting +p,, for g, in Eq. (9.61) and 
canceling pm, we have 


P1---Pm—-1 = £41 ---n-1 (9.62) 


By the inductive assumption m = n, and the remaining p; and gq; may be reordered 
and paired as associates, so that the factorization of f into irreducible poly- 
nomials with integer coefficients is unique up to order and association of the 
factors. 


Ring Morphisms 


The theory of ring morphisms should seem a breezy topic after having studied 
group morphisms in Section 8.6. The only difference in the definition in the context 
of rings is that there are two binary operations to preserve. As we are doing in- 
creasingly often, we will be fairly relaxed about notation for elements and 
operations in the two rings, unless we need it to avoid confusion. 


Definition 9.14.1 Suppose R and S are rings and ¢: R — S is a function with 
the properties that p(x + y) = ¢(x) + o(y) and ¢(xy) = $(x)G(y) for all x, y € R. 
Then ¢is called a homomorphism (or morphism) from R to S. The terms monomor- 
phism, epimorphism, isomorphism, and automorphism are defined in a way 
analogous to that for groups, and if there exists an isomorphism from R to S, 
we write R= S. 


Here are some examples. Let’s start with a trivial one. 


Example 9.14.2 If R and S are rings, the mapping defined by ¢(x) = 0 for all 
x € Ris called the trivial morphism. 


Example 9.14.3 The identity mapping is an automorphism of any ring. Ml 
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Example 9.14.4 Define ¢:Z— Z, by (x) = (n) +x, the equivalence class 
of x mod n. If we think of Z, as the set of integers {0,1,..., — 1} with clock 
arithmetic, @ maps x to its remainder upon division by n, and ¢ is an epi- 
morphism. For 


oe ty) =(n) ++ yl =[@) + x] +1) + y] = 6) + 60) (9.63) 
o(xy) = (n) + xy = [() + x]l@) + y] = 6460) (9.64) 


Furthermore, ¢ is onto. For if (n) + k is any coset in Z,, then d(k) =(n) +k. Hf 


Example 9.14.5 For Z[4/x] in the form of Eq. (9.33), define ¢ : Z > Z[X/x] by 


bk) = k+ OYE +++ +00 x71 (9.65) 
Then ¢isamonomorphism. 


Example 9.14.5 illustrates a subtle distinction between two ideas. After we 
constructed Z[</2] in Example 9.4.1, we said (page 303) that the integers are 
a subring of Z[\/2]. If we had been a bit more rigorous in our construction of 
Z{/2], we would have built it up, not as a set of expressions of the form a+ 
bs/2 + cs/4, but as a completely formal set of ordered triples 


Z[V2] = {(a, b,c): a,b,c € Z} (9.66) 


where addition and multiplication are defined to coincide with the definitions in 
Example 9.4.1. Specifically, defining addition and multiplication by 


(a,b,c) + (d,e, f) = (a+d,b+e,c+ f) 


.67 
(a, b, c)+(d, e, f) = (ad + 2bf + 2ce, ae+ bd + 2cf af + be + cd) we 
incorporates the behavior of /2 into the operations, even though they are mere 
manipulations of ordered triples with no apparent presence of </2. With these 
definitions, to say that the integers are a subring of Z[</2] is technically not true, 
for Z is not a subset of Z[</2] as subrings must be. Whereas the integers are a set 
of numbers, Z[/2] is a set of ordered triples of numbers. However, it does not 
mean that the link between them is somehow illusory. Defining y : Z > Z[./2] by 
y(n) = (n, 0, 0), we have an exact parallel to ¢ in Eq. (9.65). Whether you think of 
Z[s/2] as we originally defined it in Example 9.4.1 or as in Eqs. (9.66) and (9.67), 
we say that the integers have been embedded monomorphically in Z[/2]. Imagine 
the integers and Z{¥/2] as separate, where Z is a set of elements of the form n and 
Z[/2] consists of elements of the form (a, b,c). Then the range of @ is the set of 
all elements of Z[./2] of the form (a, 0, 0) and is isomorphic to the integers, and 
hence structurally the same. 
Example 9.14.5 illustrates a slight breach of rigor we committed in defining 
ring extensions in Section 9.4, so let’s clear that up. It illustrates a technicality 
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when we say that Z[/2] is an extension of the integers. If R and S are rings where 
R is a subset of S, then saying S is an extension of R has the same meaning as 
saying R is a subring of S. But if you start with R and want to extend it to some $ 
by adjoining an element or elements, the standard, more rigorous way is to build 
S from scratch and then monomorphically embed R in S. Then when we say that 
S is an extension of R, we mean that S is the range of a monomorphism whose 
domain is R. 


EXERCISE 9.14.6 Let R be a commutative ring, and fix some ring element a. 
Define ¢q : R[t] > R by ¢o(f) = f(a). Clearly, dq is a function (quick mental 
exercise). Show that ¢ is an epimorphism. This particularly important morphism 
is called the evaluation at a morphism. It maps every polynomial in R[f] to its 
value at a. 


EXERCISE 9.14.7. Let R be a commutative ring with unity. Describe the 
evaluation morphisms ¢o and ¢¢. 


EXERCISE 9.14.8 On the Gaussian integers, define ¢(a + bi) = a — bi. Show 
that ¢ is an automorphism. This is called the conjugation morphism, for it sends a 
Gaussian integer to its complex conjugate. 


EXERCISE 9.14.9 Let R be a ring with unity e, and define ¢:Z— R by 
o(n) = ne. Show that ¢ is a morphism. 


EXERCISE 9.14.10 Define ¢:Zoy2 > Z by o({m al) =a. Is da 


morphism? Prove or disprove. 


EXERCISE 9.14.11 Define 6: Z— Zox2 by ¢(n) =|" + |. Is ¢. a morphism? 
y 0 ry 


Prove or disprove. 


Properties of Ring Morphisms 


Since a ring morphism preserves addition between the two rings as abelian addi- 
tive groups, our results from Section 8.6 apply to addition. Thus we gain the 
following for free from Theorem 8.6.8 and Exercise 8.6.10. 


Theorem 9.14.12 Suppose ¢: R > Sis aring morphism. Then 
1. 6(0) = 0. 

2. o(—r) = —(r) for allr € R. 

3. o(nr) = ng(r) for all r € R and all integers n. 


To have analogous results for multiplication, we have to make some modifications. 
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EXERCISE 9.14.13 If R is a ring with unity er and if 6: R—> S is a ring 
morphism such that (er) = 0, then ¢ is the trivial morphism. 


The contrapositive of Exercise 9.14.13 reveals that if @: R — S is a nontrivial 
morphism, then the unity of R does not map to the zero element of S. If S has 
unity es, then comparable to Theorem 8.6.8, you might be tempted to think that a 
nontrivial morphism ¢ : R > S would have to satisfy ¢(er) = es. To prove such a 
property and the exponent rules related to it, multiplicative cancellation is neces- 
sary in S. Then a proof of the next result becomes possible. Since the proof of the 
exponent rules in parts 2 and 4 would be identical to the proof of Exercise 8.6.10, 
you only need to prove parts 1 and 3. 


Theorem 9.14.14 Suppose R is a ring with unity er, S is an integral domain 
with unity es,and @: R > S is a nontrivial ring morphism. 


1. d(er) = es. 

2. For allr € R and nonnegative integers n, d(r") = [¢(r)]". 
3. Ifu € Risa unit, then so is o(w), and ¢(u—') = [d(w)]}-1. 
4 


. Ifu € Risa unit, then d(u~") = [b(u)]~” for all positive integers n. 
EXERCISE 9.14.15 Prove parts (1) and (3) of Theorem 9.14.14. 


Theorems 9.14.12 and 9.14.14 reveal that the only nontrivial morphism from 
the integers to any other integral domain is defined by ¢(n) = ne. For suppose 
w:Z— Disa nontrivial morphism and e is the unity in D. Since 1 is the unity 
in the integers, it must be that w(1) = e. Furthermore, for any integer n, w(n) = 
w(n- 1) =nw(1) = ne. Thus the only nontrivial morphism from the integers to the 
integers is the identity. You can use other parts of Theorems 9.14.12 and 9.14.14 
to show the following. 


EXERCISE 9.14.16 The only automorphism of the rationals is the identity. 


If S is not a domain, Theorem 9.14.14 might not apply. For example, if ¢ is the 


morphism from Exercise 9.14.11, @(1) = which is not the identity matrix. 


0 OV 
Furthermore, though 1 is a unit in the integers, ¢(1) is not a unit in Z2x2. 

A lot of our work with group morphisms involved their relationship to normal 
subgroups. The interesting relationships between ring morphisms and substruc- 
tures involve ideals. The next theorem is a parallel to Exercise 8.6.14. When you 
prove the left-sided case as an exercise, parts of your proof will follow imme- 
diately from your work in group theory and not require new arguments. For 
example, in your proof of part (1), you may simply point out that since R is an 
additive group with respect to addition, @(R) is an additive subgroup of S by 
Exercise 8.6.14. That takes care of three of the requirements in showing that 
¢(R) is a subring of S. 
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Theorem 9.14.17 Suppose ¢: R > Sis a ring morphism. Then 


1. @(R) is a subring of S. 
2. If J is a left (right) ideal of R, then ¢(J/) is a left (right) ideal of @(R). 
3. If J isa left (right) ideal of S, then @~!(J) is a left (right) ideal of R. 


EXERCISE 9.14.18 Prove the left-sided cases of Theorem 9.14.17. 


Similar to groups, the kernel of a ring morphism is defined as 
Ker(@) = {re R: d(r) = 0} (9.68) 


Right away, we have the following. Save yourself some work by applying 
Exercise 8.6.16. 


EXERCISE 9.14.19 If g: R— Sisaring morphism, then Ker(@) is a two-sided 
ideal of R. 


The following needs no additional proof, for it follows from Exercise 8.6.17 
applied to R as an additive group. 


Theorem 9.14.20 If ¢: R— S is a ring morphism, then Ker(¢) = {0} if and 
only if ¢ is one-to-one. 


If R is any ring with unity element e, the morphism from the integers defined 
by @(n) = ne can look either of two ways, depending on the characteristic of R. If 
R has characteristic zero, then by definition ne is never zero for nonzero integer 
n. The following should therefore be quick. 


EXERCISE 9.14.21 If R is a ring with unity e and characteristic zero, then 
g y 
o¢:Z— R defined by $(n) = ne is one-to-one. 


Exercises 9.14.9 and 9.14.21 say that the integers can be monomorphically 
embedded in a ring R with unity and characteristic zero. We loosely say that 
R contains the integers, meaning it contains a subring generated by its unity that 
is isomorphic to the integers. For example, in R2x2, the image of the integers 
under ¢ is 


(IE BIBI I] 0% 


If R has nonzero characteristic, then there is a smallest positive integer n for 
which ne = 0. It might seem pretty clear that R in this case will contain a subring 
that looks like Z, instead of the integers. Before we can make an argument for 
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this, however, we need to create the notion of a quotient ring, which we will do in 
the next section. 


Quotient Rings 


With a group and a normal subgroup we can build a quotient group. In an analo- 
gous way, given a ring and an ideal, we can build a quotient ring. Part of the work 
in building a quotient ring is exactly like building a quotient group, so the fact 
that a quotient ring has properties R1—-R10 has already been done in part. As with 
groups, building the quotient structure begins by defining a form of equivalence. 


Theorem 9.15.1 Let R be aring and / an ideal of R. For ring elements a and b, 
define a =, bifa—b € I. Then =; is an equivalence relation on R. 


Since R is an abelian group with respect to its addition operation, J is a normal 
additive subgroup of R. Therefore, Theorem 9.15.1 is merely a restatement of 
Exercise 8.3.2 in its additive form and needs no additional proof. So we are ready 
to define the set R/J with its addition and multiplication operations, and show 
that it has all properties R1-R10. There are no surprises in the definitions of the 
binary operations on R//. But considering the need for H to be anormal subgroup 
of G in showing the binary operation on G/H is well defined, little bells should 
be going off in your head as you consider the burden of showing that addition 
and multiplication on R/J are well defined. It should be no surprise that you 
will exploit the fact that / is an ideal of R in at least part of this demonstration. 
What you might not see at first is what kind of ideal J needs to be and where you 
will call on the fact that J is this kind of ideal. Since addition in R is commutative, 
Zisnormal as an additive subgroup of R. Thus addition is well defined by our work 
in Chapter 8. It’s a different story for multiplication, though. If multiplication is 
not commutative, then a left ideal might not be a right ideal, and vice versa. So 
we might wonder whether J needs merely to be either a left ideal or a right ideal, 
or perhaps both. It turns out that J needs to be a two-sided ideal, and you will see 
why when you provide the missing piece of the proof of the following. 


Theorem 9.15.2 Let R be a ring and / a two-sided ideal of R. For a € R write 
I+ a= [a], where [a] is the equivalence class of a modulo J. Define addition @ 
and multiplication ® on R/I = {I +a:ae R} by 


U+a6U+b)=14+ (a+b) and (J+a)@(1+b)=I1+4+ (ab) (9.70) 
Then R// is a ring under the operations © and ®. 


We can talk our way through almost all properties R1-R10, so that your 
work in proving Theorem 9.15.2 will be minimal. Since R is an abelian additive 
group and / is anormal additive subgroup, R//J has properties R1—R6. Properties 
R8-R10 are immediate from the definitions of © and ®. The only property that 
takes any real work is R7, showing that © is well defined, and this is what you 
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will show in the next exercise. True to form, you suppose /+a=1+b and 
I+c=I1+d and use this to show that J + ac = I + bd. That is, supposing a — b 
and c — d are in J should somehow allow you to show that ac — bd is in J also. 


EXERCISE 9.15.3 Finish the proof of Theorem 9.15.2 by showing that multipli- 
cation is well defined. 


With R/J defined and shown to be a ring, we should be able to bypass all 
the chatty exposition as in Chapter 8 and jump right to results analogous to 
Theorems 8.6.19 and 8.6.21. 


EXERCISE 9.15.4 


(a) State a result analogous to Theorem 8.6.19 for a ring and the quotient ring 
created by “modding” out an ideal. 


(b) Explain how all except one component of the proof of this theorem follows 
from Theorem 8.6.19. 


(c) Provide the missing component to complete the proof of your theorem. 


EXERCISE 9.15.5 Suppose R and S are rings and ¢@ : R > Sis an epimorphism. 
Then S = R/ Ker(¢). 


Let’s return to the morphism in Exercise 9.14.9 defined by ¢(n) = ne. Ifa ring 
has nonzero characteristic, then there is a smallest positive integer n for which 
ne = 0. By Exercise 9.14.19, Ker(@) is an ideal in the integers, which is a PID. 
Thus Ker(¢) = (k) for some integer k. Since $(k) = 0, and since n is the smallest 
positive integer for which ne = 0, it must be that k > n. But since n € Ker(@), it 
must be that n is a multiple of k, so that k < n. Thus k = n. By Exercise 9.15.5, the 
range of ¢ is isomorphic to Z/(n). That is, R contains a subring isomorphic to Zp. 

If we think of a ring with unity e as an abelian additive group and let S be the 
subgroup generated by e, the identity element of the other operation, then the 
form of S can be determined by looking at the additive form of Eq. (8.22). 


S={ne:neZ} (9.71) 


which is precisely the range of ¢. Thus as far as addition is concerned, S is the 
smallest additive subgroup of R that contains e. If we could show that S is closed 
under multiplication, we would have that S is the smallest subring of R that con- 
tains e. But closure is immediate by Exercise 9.3.20. In bits and pieces, we have 
proved the following. 


Theorem 9.15.6 Suppose R is a ring with unity. If R has characteristic zero, 
then it contains a subring isomorphic to the integers. If R has positive characteris- 
ticn, then it contains a subring isomorphic to Z,,. In either case, such is the smallest 
subring of R that contains e. 
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Because the polynomial ring over a field is Euclidean, it makes for some partic- 
ularly important quotient rings. Let’s spend some time studying the quotient ring 
created by “modding” out the ideal generated by f = 2° — t +5 from Q[r]. The 
ties back to the integers and the integers modulo n are uncanny. To be concrete, 
we will let n = 6 and draw parallels between the relationship of the integers to the 
integers modulo 6 and the relationship of Q[f] to Q[t]/(/f). 

Since the integers are a Euclidean domain, every integer can be written as 
6q +r for integers g and r and where 0 < r < 5. Consequently, every integer is 
equivalent mod 6 to some element of {0, 1, 2, 3, 4, 5},so that every element of Ze is 
acoset that can be addressed by a unique representative element in {0, 1, 2, 3, 4, 5}. 
To perform addition and multiplication in Ze, a purist would write something like 


(6) +4] +[(66) +3] = 6) +7= (6) +1 (9.72) 


or 
[(6) + 5] x [(6) +4] = (6) + 20 = (6) +2 (9.73) 


where the first step in these two calculations is an application of the definitions of 
addition and multiplication in Ze from Theorem 9.15.2 and the second step is an 
application of equivalence mod 6 to simplify the calculation to a standard form 
with representative element from {0, 1, 2,3, 4,5}. As long as we realize that this 
is what we’re doing, we can write these calculations as 


44+3=67=61 and 5x4=6 20=¢2 (9.74) 


This form has the imagery of performing addition and multiplication in the integers, 
with the stipulation that any sum or product that gets kicked out of bounds, that is, 
out of {0, 1, 2, 3, 4, 5}, is hauled back into {0, 1, 2, 3, 4, 5} by subtracting a multiple 
of 6. All this helps us see how we may view elements of Zs and how they combine 
by addition and multiplication to produce other elements of Ze¢. 

For our specific polynomial f = 2° —t+5, the way to visualize elements 
of Q{t]/(f) in terms of polynomials in Q[f] is strikingly similar. Since QJ[f] is a 
Euclidean domain, any polynomial can be written uniquely as fq + r for polynomi- 
als g andr, where either r = 0 or degr < deg f. Soif we consider any coset (f) + g 
in the quotient ring, there is a unique polynomial r such that (f) +r = (f)+8 
and either r= 0 or degr < deg f. Thus elements of Q[r]/(f) may always be 
addressed by a representative polynomial r where either r = 0 or degr < deg f. 
Since we are using f = 21° — t+ 5, then we may write 


QU/(f) = {(f) +t? + bt+c:a,b,ceQ (9.75) 


and know that every element of the quotient ring can be written as some coset 
(f) + at? + bt +c. Furthermore, if (f) +71 = (f) +1, then ro =f) r1, so that 
r2 —1r1 is a multiple of f. Now nonzero multiples of f cannot have a degree 
less than the degree of f, but it is impossible that deg(r2 — 11) > deg f. Thus 
r| = rz, and we have that different polynomials of the form at” + bt + c will always 
generate different cosets of (f). 
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In the same way that we view elements of Z simply as {0, 1, 2,3, 4, 5}, we can 
view elements of Q[r]/(f) simply as polynomials of the form at* + bt +c. But we 
must consider how they add and multiply. Instead of using coset notation and 
writing [(f) + at? + byt + c1] + [(f) + ant? + bot + c2], we can just write 


[ayt? + byt + cr] + [ant + bot +. c2] =(p) (a1 +ap)t? + (by + ba)t + (c1 +2) 
(9.76) 


Adding two such polynomials cannot produce a sum of any larger degree, so 
Eq. (9.76) is all that needs to be said about addition in the quotient ring. However, 
for multiplication, let’s illustrate with a concrete example, where the details of 
polynomial division have been omitted. 


(47 + 2)Gf — 2t +8) =f) 1214 — 8F + 387? — 44 + 16 
=f) (6t — 4) f + 441? — 381 + 36 (9.77) 
=(f) 4472 — 38r+ 36 
If we multiply two elements of the quotient ring as if they were polynomials 
in Q[t], and we produce a product of degree at least three, we can apply the 
division algorithm to subtract an appropriate multiple of f from the product to 


produce an equivalent polynomial of the form at? + bt + c. Now it’s time for you 
to practice this procedure. 


EXERCISE 9.15.7. In Q[r], let f = 14+ 2r+1. Construct the form of elements 
of Q[f]/(f), and illustrate addition and multiplication. 


Now for another very interesting example. Since Zs is a field, Zs3[t] is a 
Euclidean domain, and we can construct the quotient ring Zs[r]/(f) for f € Zs[t] 
in a similar way. Let’s use f =#+t+2, construct the quotient ring, and 
look at addition and multiplication. By exactly the same reasoning as before, 
Za[tl/(f) = {at? + bt +c: a,b,c € Z3}. Notice this is a finite set. Each of a, b, 
and c can take on values from {0, 1, 2}, so Zs3[t]/(f) has 27 elements. Adding 
elements of Z3[t]/(f) is easy: 


(207 +442) + (7? +2t+ 2) =p) 3° +3t+ 4 =p 1 (9.78) 


Doing multiplication would look like the following if we simplify the product by 
way of the division algorithm. 


(Qf? + t+ 2) + 2t +2) =p) 214 + 50° + 8 +6r-+4 
=(f) 24 + 2 + 21 +1 
=(f) (2t + 2)f 


=f) 9 


(9.79) 
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However, there is a slick way to simplify multiplication by making substi- 
tutions. In Zs[t]/(f), f =f) 0, or Pape? =() 0. This can also be written as 
P =f) —t — 2 =f) 2t + 1. This means that any r° produced in the process of mul- 
tiplication can be replaced with 2r + 1, thus bringing the degree of a product back 
down. 


(Qf? +t +2) + 2t +2) =p) 214 + 50° + 87 +64-+4 
=f) 24+ 2P +27 +1 
=f) (20° +2 +27 +1 
=f) (2)(2t+1)+22t+) +2741 (9.80) 
=p) 4° 4244442427 41 
=f) 6t7 + 6t +3 
=f) 9 


EXERCISE 9.15.8 In Z3{t], let f =t4++2r+1. Construct the form of ele- 
ments of Zs[t]/(f), and illustrate addition and multiplication using the fact that 
t= (ft + 2, 


EXERCISE 9.15.9 Every element of Q[r]/ (t2 — 2) can be written in the form 
at + b for rational a and b. Use a trick similar to that in Exercise 9.15.8 to simplify 
(at + b)(ct+d). 


EXERCISE 9.15.10 InExercise 9.4.8 you showed Q[V2] is a field. Calculate and 
simplify (b + aV2)(d + cV2), and compare to Exercise 9.15.9. 


The last theorems and examples in this section illustrate some very interesting 
implications of the results we have worked so hard to develop. The next two results 
would not likely jump out at you as obvious, but they are elegant and not difficult 
to prove. 


EXERCISE 9.15.11 Suppose R is a commutative ring and / is an ideal of R. 
Then R// is an integral domain if and only if J is a prime ideal. 


The next result could actually be stated in if-and-only-if form, but we only 
need one direction. If R is a commutative ring with unity e, then R/J is also 
a commutative ring with unity 7 + e. Also, if J is maximal, it is by definition a 
proper ideal of R,so that R/J has more than one element. Thus if we choose some 
I+aeR/I where 1+a41+0, then a is not an element of J. Exercise 9.6.17 
and the maximality of J are just what you need to prove the following. 


EXERCISE 9.15.12 Suppose R is a commutative ring with unity, and J is an ideal 
of R. If J is maximal, then R/J/ is a field. 
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Now let’s put all this together in a very elegant construction. If K isa field, then 
K[t] is a Euclidean domain, hence a PID. If f € K[t] is irreducible (prime), then 
(f) is a prime ideal by Exercise 9.10.16. By Exercise 9.10.6, (f) is also maximal. 


Therefore, K[t]/(f) is a field. We can use these facts to do the following. 


Example 9.15.13 Construct a field with nine elements. 


Solution Since ¢? + 1 has no roots in Z3[¢], it is irreducible. Thus Z3[1]/(1? + 1) = 
{at+b:a,b € Zs} isa field with nine elements. For notational simplicity, we write 
at + b = (a,b) and illustrate multiplication in the Table 9.81. Notice 7 =(241) 2 
and the manifestation of this in the table. Also, notice how the table reveals that 
every element has a multiplicative inverse. 


x | (0,0) | 0,1) | ©,2) | 4,0) | a.) | @.2) | 2,0) | 2,1) | 2.2) 
(0,0) | (0,0) | (0,0) | (0,0) | 0,0) | (0,0) | (0,0) | ,0) | ,0) | (0,0) 
(0,1) | (0,0) | (0,1) | ©,2) | d,0) | a.1) | 2) | 2,0) | 2,1 | 2,2) 
(0,2) | (0,0) | (0,2) | 1) | 2,0) | (2,2) | 2. | 4,0) | a.2) | ayy 
(1,0) | (0,0) | 4,0) | 2,0) | ©,2) | 4.27} 227} On|] ay} ay 
(1,1) | 0,0) | a. | 2,2) | d.2) | 2.0) | OD | 2, | 0,2) | G,0) 
(1,2) | 0,0) | 4.2) | 2.) | 2,2) | 0.1) | G.0) | a, | 2,0 | €,2) 
(2,0) | (0,0) | (2,0) | 4,0) | 0,1) | (2.1) | Ga. | 0,2) | @,2) | G2) 
(2,1) | (0,0) | (2,1) | 4,2) | a.) | 0.2) | 2.0) | 2,2) | d.0) | 0.1) 
(2,2) | (0,0) | (2,2) | G,) | 2,1) | G,0) | 0,2) | 4,2) | ©, | 2,0) 

(9.81) 

P| 


EXERCISE 9.15.14 Construct a field with eight elements, providing complete 
Cayley tables for addition and multiplication. 
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<,6 

<=, 51, 60,95 

>, 5-6, 51 

>=,51 

€ (epsilon), 167-168 
e-neighborhood, 172, 173, 177 
2, 111, 110, 72, 200 
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A1-A22 assumptions, 3-7 
Abel, Neils Henrik, 247 
Abelian groups, 247 
Absolute value, 53-56. See also Norm 
Accumulation points, 177. See also Cluster 
points 
Addition 
absolute value and, 55-56 
associative property, 4, 46 
binary operation of, 4 
cancellation of, 46-47 
circular, 260 
closure property, 4, 46 
commutative property, 4, 46 
integer, 103 
of matrices, 290 
multiplication’s link with, 5, 49 
properties of, 46-48 
of rational numbers, 104-105 
subtraction and, 5 
well-defined, 4, 46 
Additive form, 257 
Additive identity, 4, 5, 46, 47 
Additive inverses, 4, 46, 47 
Adjoining roots, of ring elements, 301-304 
Aleph (Hebrew letter), 140 
Algebra, 243 


fundamental theorem of, 232, 233 
Algebraic functions, 122 
Algebraic numbers, 304 
Algebraic properties, of real numbers, 
45-51 
Algebraic structures, 243-246. See also 
Groups; Rings 
Algorithms. See Division algorithm 
Alternating group, 271-273 
Analysis, 159, 165, 221, 234 
Ancient Greeks, 108, 109, 110, 169 
AND statements, 13-14, 33-34 
Antisymmetric property (O2,T3, W2), 
114,117 
Archimedean property, 169-172 
Argument, combinatorial, 152 
Arithmetic 
clock, 260 
fundamental theorem of, 89-90 
Assignments, of numbers, 95 
Associates, 316, 317, 318 
Associative property 
addition, 4, 46 
binary operations and, 244-245 
multiplication, 4, 49 
Assumptions. See specific assumptions 
Asymmetric property, 116 
Asymptotes, 220 
Auckland, 101 
Automorphisms, 280, 281, 334, 336, 337 
Axiom of choice, 118. See also specific axioms 
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Base 10 decimal representation, 7 
Base case, 81 
Beira, 93 
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Index 


Biection, 127 
Binary operations. See specific binary 
operations 
Binomial theorem, 157-161 
Bolzano-Weierstrass theorem, 200, 201 
Boundary, 175-176 
Bounded away from L, 171 
Bounded functions, 207-208 
Bounded sequences, 187-190 
Bounded sets, 166 
Bounds. See also Least upper bound axiom 
from above, 166, 207 
from below, 166, 207 
greatest lower, 168-169 


C 


C1-C3 properties (closure of sets), 178. 
See also Closure property 
Calculus, 165, 190 
Cancellation, 49, 51 
of addition, 46-47 
in domain, 315 
multiplicative, 49, 51, 298, 315, 337 
in rings, 298, 337 
Cardinality, 135, 137, 138 
of Cartesian products, 144-148 
of finite sets, 138 
of infinite sets, 140 
Cartesian plane, 112 
Cartesian products, 111-112, 144-151 
Cauchy sequences, 202-206, 203f, 216 
Cayley tables, 245, 246f, 248f, 249f, 253f, 
262f, 263f, 274f, 344f 
Characteristic zero, 301 
Choice, axiom of, 118 
Circles, 109 
extended real numbers and, 
221-222, 221f 
unit, 215, 216 
Circular addition, 260 
Cities example, 92-93, 97-98, 99-101 
Clock arithmetic, 260 
Closed and open set, 174 
Closed intervals, 167 
Closed sets, 172-175 
Closure property 
addition, 4, 46 
integers and, 103 
multiplication, 4, 49 
of sets, 178-180 
Cluster points, 176-178 
Codomain, 119, 123 


Collections, 71. See also Families, of sets 
Combinations, 151-157 
Combinatorial argument, 152 
Combinatorics, 144 
Commensurability, 110 
Commutative property 

addition, 4, 46 

binary operations and, 245 

multiplication, 4, 49 
Commutative rings, 288, 301 
Compact sets, uniform continuity and, 

239-240 

Compactness, 180-183 
Comparing real numbers, 5-7 
Complement, 3, 63 
Complement rule, 147 
Completeness, 166 

as axiom, 204 

LUB, NIP, and, 205-206 

of metric space, 204 
Complex analysis, 159, 221 
Complex conjugate, 336 
Complex numbers, 249 

as field, 293 
Composite integer, 89 
Composition, of functions, 131-133, 132f 
Conclusion, 19 
Conjugates, 277 

complex, 336 
Conjugation morphisms, 336 
Connectedness, of sets, 175 
Content, 318, 319 
Context, of sets, 3 
Continuity, 212, 224-240 

implications of, 231-235 

IVT and, 231-233 

limit v., 225 

one-sided, 230-231 

and open sets, 233-235 

at a point, 224-228 

on a set, 228-230 

uniform, 235-240 
Continuous functions, 224-240 
Continuum, real numbers as, 108 
Contradiction, proof by, 18, 42, 70, 80-81 
Contrapositive, 21, 41, 69 
Convergence 

of Gauchy sequences, 203-205 

of sequences, to infinity, 196-197 

of sequences, to real numbers, 190-196 
Converse, of if-then statement, 21 
Corollary, 50 
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Cosets, 263, 264, 265 
Lagrange’s theorem and, 267-268 
left, 267, 279, 283 
right, 264, 266, 267 
Cosine, 215, 216 
Countable, 140 
positive rationals, 141-142, 141f 
set, 142 
Countably infinite, 140 
integers, 141 
natural numbers, 140 
whole numbers, 140 
Counterexamples, 43, 97 
Counting techniques, 144-157 
Covers, 180, 181 
open, 180, 181, 182, 183, 239 
Cycle notation, 270 
Cyclic subgroups, 255-259 


D 


D1-D3 properties (discontinuities), 227. 
See also Discontinuities 
D1-D2 properties (gcd), 60-61, 320. 
See also Greatest common divisor 
Dallas, 100, 150 
Darts analogy, 199-200 
Decimal representations, base 10,7 
Declarative sentences, 11. See also 
Statements 
Decreasing 
functions, 208 
sequences, 186 
Dedekind, Richard, 72 
Dedekind cut, 72 
Dedekind-Hasse norm, 328 
Degree 
of polynomial, 305-306 
of polynomial function, 126 
Deleted €-neighborhood, 177 
Deleted neighborhoods, 177, 210, 213, 214, 
216, 217, 218, 221, 222, 224 
DeMorgan’s laws, 17, 33, 34, 69, 76-77 


Dictionary, Random House Unabridged, 94 


Difference, of sets, 64 
Dihedral group, 273-274 
Direct proofs, 40-41 
Directed graphs, 115 
Directed paths, 116 
Discontinuities, 225, 226, 227, 227f, 228 
D1-D3 properties, 227 
jump, 227 
removable, 227 
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Disease/medication/panacea metaphor, 
170-171, 235 
Disjoint sets, 67,71 
Disjoint union, 135n, 175 
Disjunctive syllogism, 20 
Disks, 84, 91 
Disproving statements, 42-43 
Distributive property of multiplication over 
addition, 5, 49 
Diverging sequence, 192, 203 
Divisibility, 5 
absolute value and, 56 
prime numbers and, 59-62 
in rings, 297 
Division algorithm, 56-59 
extended, 325 
Divisors 
greatest common, 59-60, 320 
proper, 59, 138, 315, 317 
universal side, 327 
of zero, 298 
Domains, 119 
co, 119, 123 
Euclidean, 325-328 
holes in, 212, 213, 214, 226 
integral, 314-319 
multiplicative cancellation in, 315 
natural, 123 
principal ideal, 313, 321-325 
unique factorization, 319-321 
Dominoes proof technique, 78 
“Dot dot dot,” 41, 79, 80 


E 


E1-E3 properties, 92, 93, 94, 97, 98, 101, 114. 
See also Equivalence relations 
Element-chasing proofs, 68 
Elements (Euclid), 109 
Embedded integers 
isomorphically, in rationals, 108 
monomorphically, 
in rings, 335, 338 
Empty mapping, 137 
Empty set, 2, 69, 137, 138, 142 
Epimorphisms, 280, 281, 284, 285, 286, 334, 
335, 336, 340 
Epsilon (€ p), 167-168 
Equality, 3, 45, 91 
of functions, 124 
integer, 103 
of matrices, 290 
rational, 103-104 
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Equality (continued) 

reflexive property of, 3, 45, 91, 92 

in sets, 102-105, 243-244 

symmetric property of, 3, 45, 91,92 

transitive property of, 3, 45, 46, 91, 92 
Equivalence 

logical, 16-17 

mod n, 96-97 

on rational numbers, 103-104 
Equivalence classes, 97-102, 104 
Equivalence relations, 91-97 

E1-E3 properties, 92, 93, 94, 

97, 98, 101, 114 

of functions, 124 
Euclid, 109 
Euclidean domains, 325-328 
Evaluation at a morphism, 336 
Evaluation morphisms, 336 
Even 

integers, 56-57 

permutations, 272 
EVT. See Extreme value theorem 
Exclamations, 12 
Exclusive or, 15 
Existence, unique, 32-33 
Existential quantifier, 29-31, 37-39 
Exponentiation, defining, 85 
Exponents, 85-87 
Extended division algorithm, 325 
Extended real numbers, 221-222, 221f 
Extensions, of rings, 301-306 
Exterior, 175-176 
Extreme value theorem (EVT), 235 


F 


F1-F2 properties (functions), 119-123, 125. 


See also Functions 
Factorial, 87 
Factorization, 89-90 
Factorization domains, unique, 319-321 
Fallacies, 40 
Families, of sets, 71-78 
Fields, 287. See also Rings 
complex numbers, 293 
definition of, 292-293 
K11-K13 properties, 293 
polynomials over, 328-332 
rational numbers, 293 
real numbers, 293 
Finite sets, 135 
cardinality of, 138 
functions between, 146-148 


maximum of, 171-173 
minimum of, 171-173 
preliminary theorems for, 135-137 
subsets of, 136 
Fixed point theorem, 232 
FOIL technique, 84 
Function limit proofs, 219 
Functions, 119-161 
algebraic, 122 
bounded, 207-208 
composition of, 131-133, 132f 
continuous, 224-240 
decreasing, 208 
definition, 119-120 
equality of, 124 
equivalence relation of, 124 
examples, 120-124 
F1-F2 properties, 119-123, 125 
between finite sets, 146-148 
increasing, 208 
inverse, 133-135 
left continuous, 230-231 
maximum, 235 
minimum of, 235 
monotone, 208-210 
nonalgebraic, 215 
nondecreasing, 208 
nonincreasing, 208 
one-to-one, 125-128, 125f 
onto, 125-128 
polynomial, 122, 126 
Q1-Q6 function questions, 132-133 
range, 119, 129 
real-valued, of a real variable, 207-240 
of a real variable, 207-240 
salt and pepper, 216-217, 227 
schematic of, 120f 
sets and, 119 
signum, 217 
strictly decreasing, 208 
strictly increasing, 208 
transcendental, 215 
trigonometric, 215, 215f 
well-defined, 119-120 
Fundamental theorem 
of algebra, 232, 233 
of arithmetic, 89-90 


G 


G1-G2 properties (GLB), 168-169 
G1-GS properties (groups), 247. See also 
Groups 


Index 


Gauchy sequences, 202-206 
Gauss, Carl Friedrich, 79 
Gaussian integers, 303, 316, 
326, 336 
Gauss’s Lemma, 332 
gcd. See Greatest common divisor 
Generated ideals, 309-312 
Generated subgroups, 254-255 
Generator 
monic, 329 
of subgroup, 255, 259 
GLBs. See Greatest lower bounds 
Greatest common divisor (gcd), 
59-60, 320 
Greatest lower bounds (GLBs), 
168-169. See also Least upper bound 
axiom 
Greatness, measuring of, 60 
Greek letters 
€(epsilon), 167-168 
phi, 273 
rho, 273 
Greeks, ancient, 108, 109, 110, 169 
Group morphisms, 280-286 
Groups, 243-286. See also Subgroups 
Abelian, 247 
alternating, 271-273 
definition of, 246-247 
dihedral, 273-274 
G1-G5 properties, 247 
introduction to, 243-252 
order of, 247 
permutation, 268-274 
quotient, 260-268 
symmetric, 269-271 


H 

H1-H3 properties (subgroups), 253-254. 
See also Subgroups 

Half-closed intervals, 174 

Half-open intervals, 174 

Handshakes examples, 79, 83, 155 

Hebrew letter, aleph, 140 

Heine-Borel theorem, 181 

Holes, in domains, 212, 213, 214, 226 

Homomorphisms, 280, 334 

Horizontal asymptote, 220 

Horizontal line test, 209 

“How many ways are there to...?” examples, 
148-151 

Hypothesis, 19 

Hypothetical syllogism, 20 
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Ideals, 296, 306-309 
generated, 309-312 
left, 306 
maximal, 313-314 
prime, 312-313 
principal, 287, 310, 313 
right, 306 
two-sided, 306 
Y1-Y4 properties, 307 
Identity. See Additive identity; Multiplicative 
identity 
Identity element, 245 
Identity mappings, 124 
If-then statements, 18-27 
contrapositive, 21 
converse, 21 
definition, 19 
indeterminates and, 27 
inverse, 21 
negations of, 34-35 
variations on, 21-23 
Image, 119, 120, 120f, 125, 128-130, 129f 
Imperative sentences, 11 
Increasing functions, 208 
Increasing sequences, 186, 197 
Indeterminates, 12, 27 
Index sets, 71-73 
Induction. See also Principle of 
Mathematical Induction 
K1-K2 properties, 89 
proof by, 78 
strong, 89 
Inductive assumption, 81 
Inductive step, 81 
Inequalities, triangle-type, 55-56, 194n, 237 
Infinite, countably, 140, 141 
Infinite order, 258 
Infinite sets, 139-143 
Infinitesimals, 169-170 
Infinity 
convergence of sequences to, 196-197 
limits and, 219-220 
limits at, 220-222 
limits of, 222-224 
negative, 190, 220, 222 
neighborhood of, 196, 220, 221 
orders of, 140 
point at, 221 
positive, 190, 220, 222 
Injection, 126. See also One-to-one functions 
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Index 


Integers, 2 
addition, 103 
closure property and, 103 
composite, 89 
as countably infinite, 141 
equality, 103 
even, 56-57 
extension of whole numbers to, 108 
Gaussian, 303, 316, 326, 336 
isomorphically embedded, in rationals, 108 
monomorphically embedded, in rings, 
335, 338 
multiplication, 103 
odd, 56-57 
polynomials over, 332-334 
as products of primes, 89-90 
rings v., 288 
unbounded, from above, 169 
Z1-Z4 properties, 103 
Integers modulo n, 260-263. See also 
Quotient groups 
Integral domains, 314-319 
Interior, 175-176 
Intermediate value theorem (IVT), 231-233 
Interrogative sentences, 12 
Intersection, 64,75 
Intervals, 167. See also Nested Interval 
Property 
closed, 167 
half-closed, 174 
half-open, 174 
nested, 200, 200f, 202, 202f 
open, 167 
Inverse 
additive, 4, 46, 47 
functions, 133-135 
if-then statement, 21 
multiplicative, 5, 49 
Irrational numbers, 3, 107-111, 140, 143 
Irreducible polynomial rings, 318, 319 
Irreducible polynomials, 329, 330, 332, 334 
Irreflexive property, 116 
Isolated point, 177 
Isomorphically embedded integers, in 
rationals, 108 
Isomorphisms, 280, 281, 282, 285f, 286, 334 
IVT. See Intermediate value theorem 


J1-J2 properties (PMI), 85-86. See also 
Principle of mathematical induction 
Jump discontinuities, 227 


K 


K11-K13 properties (fields), 293 

K1-K2 properties (induction), 89. See also 
Induction; Principle of mathematical 
induction 

Kernel, 283, 284, 285, 338 


L 


L1-L2 properties (least upper bound), 7, 166. 
See also Least upper bound axiom 
Lagrange’s theorem, 267-268. See also 
Cosets 
Language, mathematics and, 11-43 
Least upper bound (LUB) axiom, 6-7, 106, 
111, 165-169 
completeness, NIP, and, 205-206 
L1-L2 properties, 7, 166 
M1-M2 properties, 167-168 
NIP and, 198-199, 201-202 
Left continuous functions, 230-231 
Left cosets, 267, 279, 283 
Left-hand limits, 217 
Left ideals, 306 
Limit points, 177. See also Cluster points 
Limits, 210-224 
continuity v., 225 
definition of, 210-213 
at infinity, 220-222 
of infinity, 222-224 
infinity and, 219-220 
left-hand, 217 
one-sided, 217-218 
right-hand, 217 
of sequences, 192 
sequential, 218-219 
theorems of, 213-217 
Line segments, straight, 109 
Linear combination, 59 
Linear independence, 302n 
Logic, 11-43 
Logical equivalence, 16-17 
of LUB, NIP, and completeness, 205-206 
proving, 41-42 
tautologies and, 23-27 
LUB axiom. See Least upper bound axiom 


M 


M1-M2 properties (LUB), 167-168 
Mappings, 119. See also Relations 
empty, 137 
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identity, 124 
relations v., 124 
well-defined, 122 
Mathematics 
ancient Greeks and, 108, 109, 
110, 169 
language and, 11-43 
Mathematics class metaphor. See PU 
metaphor 
Matrices, 289-291 
Maximal ideals, 313-314 
Maximum 
of finite sets, 171-173 
of function, 235 
Medication/disease/panacea metaphor, 
170-171, 235 
Metric, 165, 204, 234 
Metric space, 204 
Minimum 
of finite sets, 171-173 
of function, 235 
Minus zero, 48 
Mistakes, in proofs, 97 
Modding out, 266, 340, 341 
Modulo n 
equivalence, 96-97 
integers, 260-263 
Modus ponens, 20, 40 
Modus tollens, 20 
Monic generator, 329 
Monomorphically embedded integers, in 
rings, 335, 338 
Monomorphisms, 280, 334, 
335, 336 
Monotone functions, 208-210 
Monotone sequences, 186-187 
Morphisms, 280. See also specific morphisms 
group, 280-286 
ring, 334-339 
Multiplication 
absolute value and, 56 
addition’s link with, 5, 49 
associative property, 4, 49 
binary operation, 63 
closure property, 4, 49 
commutative property, 4, 49 
distributive property of multiplication 
over addition, 5, 49 
division and, 5 
integer, 103 
of matrices, 290 
properties of, 49-51 
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of rational numbers, 104-105 

in rings, 300 

rule of, 148 

well-defined, 4, 49 
Multiplicative cancellation, 49, 51 
in domain, 315 
in rings, 298, 337 
Multiplicative identity, 4,5, 49 
Multiplicative inverses, 5, 49 


N 


N1-N3 properties (norm), 54-55, 
165, 306 
Natural domain, 123 
Natural numbers, 2 
as countably infinite, 140 
unbounded, in reals, 42 
Negations 
of if-then statements, 34-35 
of statements, 13 
of statements, with existential quantifier, 
37-39 
of statements, with universal quantifier, 
36-37 
of AND statements, 33-34 
of tautology, 18 
Negative infinity, 190, 220, 222 
Negative numbers, 5, 6, 51 
e-neighborhood, 172,173,177 
Neighborhoods, 196 
deleted, 177, 210, 213, 214, 216, 217, 218, 
221, 222, 224 
of infinity, 196, 220, 221 
Nested Interval Property (NIP), 166, 
197-202 
completeness, LUB, and, 205-206 
LUB and, 198-199, 201-202 
subsequences and, 199-201 
Nested intervals, 200, 200f, 202, 202f 
Newton, Isaac, 190 
NIP. See Nested Interval Property 
Nonalgebraic functions, 215 
Nondecreasing functions, 208 
Nondecreasing sequences, 186 
Nonincreasing functions, 208 
Nonincreasing sequences, 186 
Nonzero real numbers, 2 
Norm, 54-55, 165, 234, 306 
Dedekind-Hasse, 328 
Normal subgroups, 265, 275-280 
Notation, set, 1-3 
Numbers. See specific number systems 
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O 


O1-03 properties (order relation), 114 
Odd integer, 56-57 
Odd permutations, 272 
One # to zero, 5, 49, 52 
One-sided continuity, 230-231 
One-sided limits, 217-218 
One-to-one correspondence, 127 
One-to-one functions, 125-128, 125f 
Onto functions, 125-128 
Open and closed set, 174 
Open covers, 180, 181, 182, 183, 239 
Open intervals, 167 
Open sets, 172-175, 233-235 
Or, exclusive, 15 
OR statements, 14-16, 33-34 
Order relation, 114, 116, 117 
Ordering properties, S—7, 51-53. See also 
Well-ordering principle 

Order(s) 

of group, 247 

infinite, 258 

of infinity, 140 

n, 258 


P 


P1-P3 properties (partitioning), 98-102 
Panacea/medication/disease metaphor, 
170-171, 235 
Paradoxes, as statements, 12 
Partial ordering, 114 
Partitioning, 98-102, 152-157 
Pascal’s triangle, 157-161, 157f 
Peano’s postulates, 102 
Permutation groups, 268-274 
Permutations, 147, 269, 271-272 
Phi, 273 
Picture, proof by, 67, 247n 
PIDs. See Principal ideal domains 
PMI. See Principle of mathematical 
induction 

Point(s) 

accumulation, 177 

cluster, 176-178 

continuity at a, 224-228 

at infinity, 221 

isolated, 177 

limit, 177 
Polynomial functions, 122, 126 
Polynomial rings, 304-305, 318, 319 
Polynomials 


degree of, 305-306 
irreducible, 329, 330, 332, 334 
over a field, 328-332 
over integers, 332-334 
primitive, 319, 332, 333, 334 
Positive infinity, 190, 220, 222 
Positive integers, unbounded from above, 
169 
Positive numbers, 5, 6, 51 
Positive rationals, countable, 141-142, 141f 
Power sets, 113 
Pre-image sets, 119, 125, 130-131, 130f 
Prestigious University metaphor. See PU 
metaphor 
Prime ideals, 312-313 
Prime numbers, 59-62 
products of, integers as, 89-90 
Primitive polynomials, 319, 332, 333, 334 
Principal ideal domains (PIDs), 313, 
321-325 
Principal ideals, 287, 310, 313 
Principle of mathematical induction (PMI), 
78-91 
Principle of zero products, 51, 106, 298, 
314, 315 
Projection morphisms, 281 
Proofs 
by contradiction, 42, 70 
by contrapositive, 41, 69 
direct, 40-41 
disproving statements, 42-43 
dominoes and, 78 
element-chasing, 68 
function limit, 219 
by induction, 78 
of logically equivalent statements, 41-42 
mistakes in, 97 
by picture, 67, 247n 
of set properties, 67-71 
writing, 40-43 
Proper divisor, 59, 138, 315, 317 
Proper subgroups, 252 
Proper subrings, 294 
Proper subsets, 2 


PU (Prestigious University) metaphor, 73-78 


Pythagoras, 109 
Pythagorean theorem, 108, 110, 110f 


Q1-Q6 function questions, 132-133 


Quantifiers. See Existential quantifier; 
Universal quantifier 
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Quaternions, 248, 253, 254, 257 
Quilt story, 180 

Quotient, 57 

Quotient groups, 260-268 
Quotient rings, 339-344 


R 
R1-R10 properties (rings), 288-289, 339 
Random House Unabridged Dictionary, 94 
Range, 119, 129 
Rational equality, 103-104 
Rational numbers, 2 
addition of, 104-105 
building, 102-105 
equivalence classes of, 104 
equivalence on, 103-104 
as field, 293 
isomorphically embedded integers in, 108 
multiplication of, 104-105 
positive, countable, 141-142, 141f 
Real numbers, 2, 165-183 
algebraic properties of, 45-51 
assumptions about, 3-7 
base 10 decimal representations of, 7 
comparing, 5-7 
as continuum, 108 
convergence of sequences to, 190-196 
extended, 221-222, 221f 
as field, 293 
nonzero, 2 
ordering properties, 5—7, 51-53 
properties of, 45-62 
reflexive property of equality in, 
3,45, 91 
roots of, 7, 106-107 
sign of, 51 
unbounded natural numbers in, 42 
uncountable, 143 
Real-valued functions, of a real variable, 
207-240 
Recursive sequence, 85, 87 
Reflexive property of equality 
in real numbers, 3, 45, 91 
on sets, 91,92 
Reflexive property (T1, W1), 117 
Relations, 92, 111-118, 124. See also 
Equivalence relations; Order relation 
Relatively prime numbers, 61 
Remainder, 57 
Removable discontinuities, 227 
Repeating 9 pattern, 7, 143n 
Rho, 273 
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Right continuous functions, 230-231 
Right cosets, 264, 266, 267 
Right-hand limits, 217 
Right ideals, 306 
Rigid square, 273, 273f, 278, 278f 
Ring elements, adjoining roots of, 301-304 
Ring morphisms, 334-339 
Rings, 287-344 
commutative, 288, 301 
definition of, 287-289 
divisibility in, 297 
extensions of, 301-306 
integers v., 288 
matrix, 289-291 
monomorphically embedded integers in, 
335, 338 
multiplication in, 300 
multiplicative cancellation in, 298, 337 
polynomial, 304-305, 318, 319 
properties of, 296-301 
quotient, 339-344 
R1-R10 properties, 288-289, 339 
subrings, 287, 293-295 
unit of, 297 
Roots, of real numbers, 7, 106-107 
Rules of inference, 40 


S 


S1-S3 properties (strict order relation), 116 
S1-S4 properties (subrings), 293 
Salt and pepper function, 216-217, 227 
Sandwich theorem, 196, 201, 214, 221 
Schematic, of function, 120f 
Sentences, 11-12. See also Statements 
Sequences, 185-206 
bounded, 187-190 
Cauchy, 202-206, 203f, 216 
convergences of, 190-197 
diverging, 192, 203 
monotone, 186-187 
Sequential limits, 218-219 
Series solution, 159 
Set cardinality. See Cardinality 
Sets, 1, 63-118. See also specific sets 
binary operations and, 63-65, 243 
bounded, 166 
Cartesian products of, 111-112, 144-151 
closed, 172-175 
closed and open, 174 
closure of, 178-180 
compact, uniform continuity and, 239-240 
complement of, 3 
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Cartesian products of (continued) 
connectedness of, 175 
context of, 3 
continuity on, 228-230 
countable, 142 
difference of, 64 
disjoint, 67, 71 
elements of, 1-2 
empty, 2, 69, 137, 138, 142 
equality in, 102-105, 243-244 
families of, 71-78 
finite, 135-139 
functions and, 119 
index, 71-73 
infinite, 139-143 
intersection of, 64 
notation, 1-3 
open, 172-175 
open and closed, 174 
partitioning of, 98-102, 152-157 
power, 113 
pre-image, 119, 125, 130-131, 130f 
properties, proofs of, 67-71 
reflexive property of equality on, 91, 92 
subsets, 2, 66 
supersets, 2 
symmetric difference of, 65 
symmetric property of equality on, 
91,92 
terminology, 1-3, 63-67 
transitive property of equality on, 91, 92 
types of, 2 
union, 64 
well-defined operations on, 70,71 
Side divisor, universal, 327 
Sign, of real numbers, 51 
Signum function, 217 
Sine, 215, 216 
SPMIL. See Strong principle of mathematical 
induction 
Square, rigid, 273, 273f, 278, 278f 
Statements, 11-13. See also If-then 
statements 
AND, 13-14 
AND, negation of, 33-34 
definition, 12 
disproving, 42-43 
with existential quantifier, negation of, 
37-39 
indeterminates and, 12 
negation of, 13 
OR, 14-16 


OR, negation of, 33-34 

paradoxes as, 12 

stronger, 24 

with universal quantifier, negation of, 

36-37 

weaker, 24 
Straight line segments, 109 
Strict order relation, 116 
Strictly decreasing functions, 208 
Strictly decreasing sequences, 186 
Strictly increasing functions, 208 
Strictly increasing sequences, 186 
Strong induction, 89-91 
Strong principle of mathematical induction 

(SPMI), 89 

Stronger statements, 24-27, 67 
Subcovers, 180, 181 
Subfamilies, 77 
Subgroups, 252-259 

cyclic, 255-259 

generated, 254-255 

normal, 265, 275-280 
Subrings, 287, 293-295. See also Ideals 
Subsequences, 199-201 
Subsets, 2, 66 

of finite sets, 136 

proper, 2 
Subtraction, 5 

absolute value and, 55-56 

addition and, 5 
Successors, 102 
Sum rule, 154 
Supersets, 2 
Surjection, 125. See also Onto functions 
Syllogism, 20 
Symmetric difference, of sets, 65 
Symmetric group, 269-271 
Symmetric property of equality 

in real numbers, 3, 45, 91 

on sets, 91, 92 


T 


T1-T3 properties (topology), 235 
T1-T4 properties (total order relation), 117 
Tangent, 215, 216 
Tautologies, 18 
logical equivalence and, 23-27 
theorems as, 40, 67-68 
Theorems. See specific theorems 
Topology, 172, 234, 235 
Total order relation, 117 
Totality property, 117 
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Tower of Hanoi, 84 
Transcendental functions, 215 
Transcendental numbers, 304 
Transitive property of equality 

in real numbers, 3, 45, 46, 91 

on sets, 91,92 
Transitive property (S3,T3, W3), 116-117 
Transpositions, 271-272 
Triangle, Pascal’s. See Pascal’s triangle 
Triangle-type inequalities, 55-56, 

194n, 237 

Trichotomy law, 6, 51 
Trigonometric functions, 215, 215f 
Trivial morphisms, 281, 334 
Trivial subgroups, 253 
Trivial subrings, 294 
Truth table, 13 
Two-sided ideals, 306 
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U1-U3 properties (generated ideals), 
309-312 
U1-U3 properties (generated subgroups), 
254-255 
UFDs. See Unique factorization domains 
Unbounded natural numbers, in reals, 42 
Unbounded positive integers, 
from above, 169 

nbounded sequences, 187-188 
ncountable irrationals, 140, 143 
ncountable real numbers, 143 
niform continuity, 235-240 
nion 
disjoint, 135n, 175 
over family of sets, 75 
of sets, 64 
Unique existence, 32-33, 47 

multiplicative inverse, 49 

theorem, 56 
Unique factorization domains (UFDs), 

319-321 

Uniqueness 
additive identity, 47 
additive inverse, 47 
of factorization, 89 
gcd(a,b), 60-61 
of image, 120, 120f, 125 
multiplicative identity, 49 
quotient, 57 
remainder, 57 
theorems, 47 
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Unit, of ring, 297 

Unit circle, 215, 216 

Unity element, 288 

Universal quantifier, 28-29, 36-37 
Universal set, 3 

Universal side divisor, 327 

Upper bounds, 6-7 


V 


Valuation, 326, 327, 328 
Venn diagrams, 63, 63f, 64f, 65f, 66f, 67 
Vertical asymptote, 220 


W 


W1-W4 properties (well order relation), 
117 
Weaker statements, 24 
Well defined 
addition, 4, 46 
addition of rational numbers, 105 
binary operation, 244 
cardinality of finite set, 138 
functions, 119-120 
mapping, 122 
multiplication, 4, 49 
multiplication of rational numbers, 105 
operations on sets, 70, 71 
Well order relation, 117 
Well-ordering principle (WOP), 6, 57-58. 
See also Principle of Mathematical 
Induction 
as axioms, 78 
PMI from, 78, 79, 80 
Wellington, 101 
Whole numbers, 2, 102 
construction of, 102 
as countably infinite, 140 
extension of, to integers, 108 
as successors, 102 
WOP. See Well-ordering principle 
Wrangell, 100 


X 


X1-X3 properties (roots), 106-107. See also 
Roots, of real numbers 


Y 


Y1-Y4 properties (ideals), 307. See also 
Ideals 
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Z1-ZA4 properties (integer assumptions), 103 
Zermelo, Ernst, 118 
Zero, 48 


characteristic, 301 


divisors of, 298 
infinitesimals v., 169 
minus, 48 
one # to, 5, 49, 52 
Zero products, principle of, 51, 106, 298, 
314, 315 


